This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 



BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 



BLACK BORDERS 

TEXT CUT OFF AT TOP, BOTTOM OR SIDES 
FADED TEXT 
ILLEGIBLE TEXT 
SKEWED/SLANTED IMAGES 
COLORED PHOTOS 

BLACK OR VERY BLACK AND WHITE DARK PHOTOS 
GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



J 



® 



Europaisches Paten tarn t 
European Patent Office 
Office europeen des brevets 




(ij) Publication number : 0 574 257 A2 



EUROPEAN PATENT APPLICATION 



@ Application number : 93304500.7 
(g) Date of filing : 10.06.93 



® int ci. 5 : C07K 15/00, C12N 15/12, 
C12N 5/08, C12P 21/02, 
C12P 21/08, G01N 33/48 



@ Priority: 10.06.92 US 896611 
10.06.92 US 896612 

10.06.92 US 896437 

(43) Date of publication of application : 

15.12.93 Bulletin 93/50 

(84) Designated Contracting States : 

AT BE CH DE DK ES FR GB GR IE IT U LU MC 
NL PT SE 

@ Applicant : Kamboj, Rajender 
2869 Arvida Circle 
Mississauga, Ontario L5N 1R4 (CA) 

(fj) Applicant : Elliott, Candace 

74 Burlington Street, Apt No. 1 
Etobicoke, Ontario M8V 2L2 (CA) 

(fT) Applicant : Nutt, Stephen L 
74 Burlington Street, Apt No. 1 
Etobicoke, Ontario M8V 2L2 (CA) 



@ Inventor : Kamboj, Rajender 
2869 Arvida Circle 
Mississauga, Ontario L5N 1R4 (CA) 
Inventor : Elliott, Candace 
74 Burlington Street, Apt No. 1 
Etobicoke, Ontario M8V 2L2 (CA) 
Inventor : Nutt, Stephen L 
74 Burlington Street, Apt No. 1 
Etobicoke, Ontario M8V 2L2 (CA) 

(g) Representative : Mercer, Christopher Paul 
Carpmaels & Ransford 43, Bloomsbury Square 
London WC1A 2RA (GB) 



(54) Amino-hydroxy-methyl-isoxazole-propionate binding human glutamate receptors. 

(57) Described herein are isolated polynucleotides , , . * , ; 

which code for a family of AMPA-type human 
CNS receptors. The receptors are characterized 
structurally and the construction and use of cell 
lines expressing these receptors are disclosed. 



CM 



Q. 

Ui 



i i : i 

i i 



! ! = s i 
I .!.!.!. ! 



Jouve, 18, rue Saint-Denis, 75001 PARIS 



EP 0 574 257 A2 



i i i i i 



1.1.1,1,1 



EP 0 574 257 A2 



Background of the Invention 
Field of the Invention 

5 This invention is concerned with applications of recombinant DNA technology in the field of neurobiology. 

More particularly, the invention relates to the cloning and expression of DNA coding for excitatory amino acid 
(EAA) receptors, especially human EAA receptors. 

Background of the Invention 

10 

In the mammalian central nervous system (CNS), the transmission of nerve impulses is controlled by the 
interaction between a neurotransmitter substance released by the "sending" neuron which then binds to a sur- 
face receptor on the "receiving" neuron to cause excitation thereof. L-glutamate is the most abundant neuro- 
transmitter in the CNS, and mediates the major excitatory pathway in vertebrates. Glutamate is therefore re- 
ts ferred to as an excitatory amino acid (EAA) and the receptors which respond to it are variously referred to as 
glutamate receptors, or more commonly as EAA receptors. 

Using tissues isolated from mammalian brain, and various synthetic EAA receptor agonists, knowledge of 
EAA receptor pharmacology has been refined somewhat Members of the EAA receptor family are now grouped 
into three main types based on differential binding to such agonists. One type of EAA receptor, which in addition 

20 to glutamate also binds the agonist NMDA (N-methyf-D-aspartate), is referred to as the NMDA type of EAA 
receptor. Two other glutamate- binding types of EAA receptor, which do not bind NMDA, are named according 
to their preference for binding with two other EAA receptor agonists, namely AMPA(a-amino-3-hydroxy-5-me- 
thyl-isoxazole-4-propionate), and kainate. Particularly, receptors which bind glutamate but not NMDA, and 
which bind with greater affinity to kainate than to AM PA, are referred to as kainate type EAA receptors. Sinv 

25 ilarly. those EAA receptors which bind glutamate but not NMDA, and which bind AM PA with greater affinity 
than kainate are referred to as AMPA type EAA receptors. 

The glutamate- binding EAA receptor family is of great physiological and medical importance. Glutamate 
is involved in many aspects of long-term potentiation (learning and memory), in the development of synaptic 
plasticity, in epileptic seizures, in neuronal damage caused by ischemia following stroke or other hypoxic 

30 events, as well as in other forms of neurodegenerative processes. However, the development of therapeutics 
which modulate these processes has been very difficult, due to the lack of any homogeneous source of re- 
ceptor material with which to discover selectively binding drug molecules, which interact specifically at the in- 
terface of the EAA receptor. The brain derived tissues currently used to screen candidate drugs are hetero- 
geneous receptor sources, possessing on their surface many receptor types which interfere with studies of 

35 the EAA receptor/ligand interface of interest The search for human therapeutics is further complicated by the 
limited availability of brain tissue of human origin. It would therefore be desirable to obtain cells that are ge- 
netically engineered to produce only the receptor of interest With cell lines expressing cloned receptor genes, 
a substrate which is homogeneous for the desired receptor is provided, for drug screening programs. 

Recently, genes encoding substituent polypeptides of EAA receptors from non-human sources, principally 

40 rat have been discovered. Hoi I ma nn et a!., Nature 342: 643, 1989 described the isolation from rat of a gene 
referred to originally as GluR-K1 (but now called simply GluR1). This gene encodes a member of the rat EAA 
receptor family, and was originally suspected as being of the kainate type. Subsequent studies by Keinanen 
et at, Science 249: 556, 1990, showed, again in rat that a gene called GluR-A, which was identical to the pre- 
viously isolated GluR1 , in fact encodes a receptor not of the kainate type, but rather of the AMPA type. These 

45 two groups of researchers have since reported as many as five related genes isolated from rat sources. Boulter 
et at, Science 249: 1033, 1990, revealed that, in addition to GluR1, the rat contains 3 other related genes, 
. which they called GluR2, GluR3, and G!uR4, and Bettleretal., Neuron 5: 583. 1990 described GluR5. Keinanen 
et at, supra, described genes called GluR-A, GluR-B, GluR-C and GIuR-D which correspond precisely to 
GluR1, GluR2, GluR3 and GluR4 respectively. Sommer et al. t Science 249: 1580, 1990 also showed, for GluR- 

50 A, GluR-B, GluR-C and GluR-D two alternatively spliced forms for each gene. These authors, as well as Mon- 
yer et at, Neuron 6: 799, 1991 were able to show that the differently spliced versions of these genes are dif- 
ferentially expressed in the rat brain. 

There has emerged from these molecular cloning advances a better understanding of the structural fea- 
tures of EAA receptors and their subunits, as they exist in the rat brain. According to the current model of EAA 

55 receptor structure, each is heteromeric in structure, consisting of individual membrane-anchored subunits, 
each having four transmembrane regions, and extracellular domains that dictate ligand binding properties to 
some extent and contribute to the ion-gating function served by the receptor complex. Keinanen et al, supra, 
have shown for example that each subunit of the rat GluR receptor, including those designated GluR-A, GluR- 
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B, GluR-C and GluR-D, display cation channel activity gated by glutamate, by AMPA and by kainate, in their 
unitary state. When expressed in combination however, for example GluR-A in combination with GluR-B, gated 
ion channels with notably larger currents are produced by the host mammalian cells. 
5 In the search for therapeutics useful to treat CNS disorders in humans, it is highly desirable of course to 

provide a screen for candidate compounds that is more representative of the human situation than is possible 
with the rat receptors isolated to date. It is particularly desirable to provide cloned genes coding for human 
receptors, and cell lines expressing those genes, in order to generate a proper screen for human therapeutic 
compounds. These, accordingly, are objects of the present invention. 

10 

Summary of the Invention 

The present invention provides isolated polynucleotides that code for a family of AMPA-binding human EAA 
receptors, herein referred to as "GluR receptors". By providing polynucleotides that code specifically for CNS 

15 receptors native to humans, the present invention provides means for evaluating the human nervous system, 
and particularly for assessing potentially therapeutic interactions between the AMPA-binding human EAA re- 
ceptors and selected natural and synthetic Itgands. 

. In one of its aspects, the present invention provides an isolated polynucleotide that codes for an EAA re- 
ceptor belonging to the human GluR family. Alternatively, the polynucleotide may code for an AMPA-binding 

20 fragment of a human GluR receptor, or for an AMPA-binding variant of a human GluR receptor. According to 
specific embodiments of the present invention, the isolated polynucleotide encodes the human GluRIB re- 
ceptor, the amino acid sequence of which is identified in Figure 1 (SEQ ID NO: 2), the human GluR2B receptor 
the amino acid sequence of which is identified in Figure 2 (SEQ ID NO: 4), and the human GluR3A receptor, 
the amino acid sequence of which is identified in Figure 3 (SEQ ID NO: 6). According to another embodiment 

25 of the invention, the polynucleotide encodes an AMPA-binding variant of the human GluR receptor. One such 
variant is identified herein as the human GluR3B receptor, the amino acid sequence of which is identified in 
Figure 4 (SEQ ID NO: 8). In various specific embodiments of the present invention, the polynucleotide consists 
of DNA e.g. cDNA, or of RNA e.g. messenger RNA. In other embodiments of the present invention, the poly- 
nucleotide may be coupled to a reporter molecule, such as a radioactive label, for use in autoradiographic stud- 

30 ies of human GluR receptor tissue distribution. In further embodiments of the present invention, fragments of 
the polynucleotides of the invention, including radiolabeled versions thereof, may be employed either as 
probes for detection of glutamate receptor-encoding polynucleotides, as primers appropriate for amplifying 
such polynucleotides present in a biological specimen, or as templates for expression of a GluR receptor or 
AMPA-binding fragments or variants thereof. 

35 According to another aspect of the present invention, there is provided a cellular host that produces an 
AMPA-type human glutamate receptor, and is characterized by the incorporation therein of a polynucleotide 
of the present invention. In embodiments of the present invention, the polynucleotide is a DNA molecule and 
is incorporated for expression and secretion in the cellular host, to yield, upon culturing, a functional, mem- 
brane-bound human GluR receptor. In other embodiments of the present invention, the polynucleotide is an 

40 RNA molecule which is introduced into the cellular host to yield a human GluR receptor as a functional, mem- 
brane-bound product of translation. 

According to another aspect of the invention, there is provided a process for obtaining a substantially ho- 
mogeneous source of a human EAA receptor useful for performing Hgand binding assays, which comprises 
the steps of culturing a genetically engineered cellular host of the invention, and then recovering the cultured 

45 cells. Optionally, the cultured cells may be treated to obtain membrane preparations thereof, for use in the lig- 
and binding assays. 

According to another aspect of the present invention, there is provided a method for assaying interaction 
between a test ligand and a human EAA receptor, comprising the steps of incubating the test ligand under ap- 
propriate conditions with a human GluR receptor source, i.e., a cellular host of the invention or a me m bran e- 
50 preparation derived therefrom, and then determining the extent or result of binding between the substance 
and the receptor source. 

These and other aspects of the invention are now described in greater detail with reference to the accom- 
panying drawings, in which: 

55 Brief Description of the Drawings 

Figure 1 provides a DNA sequence coding for the human GluR1 B receptor, and the amino acid sequence 
thereof {SEQ ID NOS: 1 and 2); 

Figure 2 provides a DNA sequence coding for the human GluR2B receptor, and the amino acid sequence 



4 



EP 0 574 257 A2 



thereof (SEQ ID NOS: 3 and 4); 

Figure 3 provides a DNA sequence coding for the human GluR3A receptor, and the amino acid sequence 
thereof (SEQ ID NOS: 5 and 6); 
5 Figure 4 provides a DNA sequence coding for the human GiuR3B receptor, and the amino acid sequence 

thereof (SEQ ID NOS: 7 and 8); 

Figure 5 provides the amino acid sequence of the human GluR3A receptor (SEQ ID NO: 9) and the human 
GluR3B receptor (SEQ ID NO: 10) in a region of dissimilarity; 

Figure 6 depicts the strategy employed in cloning the human GluR3A receptor-encoding DNA illustrated 
10 in Figure 3; 

Figure 7 depicts the strategy employed in cloning the human GluR3B receptor-encoding DNA illustrated 
in Figure 4; 

Figure 8 depicts the strategy employed in generating recombinant DNA expression constructs incorporat- 
ing the GIuR3A receptor-encoding DNA; 
15 Figure 9 depicts the strategy employed in generating recombinant DNA expression constructs incorporat- 

ing the GluRIB receptor-encoding DNA (SEQ ID NOS: 11 and 12 are also shown in this figure); 
Figure 10 depicts the strategy employed in cloning the human GIuR2B receptor-encoding DNA illustrated 
in Figure 2; 

Figure 11 depicts the strategy employed in generating recombinant DNA expression constructs incorpor- 

20 "■ a ting the GluR2B receptor-encoding DNA; 

Figure 12 illustrates the AMPA-binding property of the human GluRIB receptor; 
Figure 13 illustrates the AMPA-binding property of the human GluR2B receptor 
Figure 14 illustrates the AMPA-binding property of the human GluR3A receptor 
Figures 15 & 16 illustrate a Scatchard analysis of human GluR1 B and GluR2B receptor AMPA binding; and 

25 Figure 17 graphically shows AMPA competition binding data for the GluR2B receptor. 

Detailed Description of the Preferred Embodiments 

The invention relates to human CNS receptors of the AMPA-binding type, and is directed more particularly 
30 to novel receptors belonging to a family herein referred to as "GluR receptors", and provides isolated polynu-f 
cleotides that code for such receptors. The term "isolated" is used herein with reference to intact polynucleo- 
tides that are generally less than about 4,000 nucleotides in length and which are otherwise isolated from DNA 
coding for other human proteins. 

As used herein, the term "GluR receptors" is intended to embrace the human GluR1 B, GluR2B and GluR3A 
35 receptors, AMPA-binding variants related thereto, as well as AMPA-binding fragments of the GluR1 B, GluR2B 
and GluR3A receptors. Receptor variants within the scope of the present invention are functional variants of 
a parent receptor, i.e., one of GluRIB, GluR2B, GluR3A and GluR3B, which include conservative amino acid 
substitutions. 

The term "AMPA-binding", as used herein with respect to receptors, and variants and fragments thereof, 
4Q refers to a tigand binding profile which reveals glutamate binding and relative greater binding affinity for AMPA 
than for either glutamate, kainate or NMDA, as determined using assays of conventional design, such as the 
assays herein described. 

In the present specification, an AMPA-binding receptor is said to be "functional" if a cellular host producing 
it exhibits de novo channel activity when exposed appropriately to AMPA, as determined by the established 

45 electrophysiological assays described for example by H oilman n et al., supra, or by any other assay appropriate 
for detecting conductance across a cell membrane. 

Members of the human GluR family of the invention possess structural features characteristic of the EAA 
receptors in general, including extracellular N- and C-terminal regions, as well as four internal hydrophobic do- 
mains which serve to anchor the receptor within the cell surface membrane. 

so More specifically, the GluRIB receptor is a protein characterized structurally as a single polypeptide chain 

that is produced initially in precursor form bearing an 18 amino acid residue N-terminal signal peptide, and is 
transported to the cell surface in mature form, lacking the signal peptide and consisting of 888 amino acids 
arranged in the sequence illustrated, by single letter code, in Figure 1 (SEQ ID NOS: 1 and 2). Unless otherwise 
stated, the term human GluR receptor, either generally or with reference to a particular member of the receptor 

55 family, refers to the mature form of the receptor. Thus, the amino acid residues of these receptors are num- 
bered in Figures 1-4 with reference to the mature protein sequence. With respect to structural domains of the 
GluR1 B receptor, hydropathy analysis reveals four putative transmembrane domains, one spanning residues 
521-540 inclusive (TM-1), another spanning residues 567-585 (TM-2), a third spanning residues 596-614 (TM- 
3) and the fourth spanning residues 788-808 (TM-4). Based on this assignment, it is likely that the human 
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GluRIB receptor structure, in its natural membrane-bound form, consists of a 520 amino acid N-terminal ex- 
tracellular domain, followed by a hydrophobic region containing four transmembrane domains and an extrac- 
ellular, 80 amino acid C-terminal domain. 

5 The GluR2B receptor, in precursor form bears a 21 amino acid residue N-terminal signal peptide, and in 

mature form, consists of 862 amino acids arranged in the sequence illustrated, by single letter code, in Figure 
2 (SEQ ID MOS: 3 and 4). With respect to structural domains of the receptor, hydropathy analysis reveals four 
putative transmembrane domains, one spanning residues 525-544 inclusive (TM-1), another spanning resi- 
dues 571-589 (TM-2), a third'spanning residues 600-618 (TM-3) and the fourth spanning residues 792-812 

w (TM-4). Based on this assignment, it is likely that the human GluR2B receptor structure, in its natural mem- 
brane-bound form, consists of a 524 amino acid N-terminal extracellular domain, followed by a hydrophobic 
region containing four transmembrane domains and an extracellular, 50 amino acid C-terminal domain. 

The GluR3A member of the human GluR family bears a 22 amino acid residue N-terminal signal peptide 
in precursor form, and is transported to the cell surface in mature form, lacking the signal peptide and con- 

15 sisting of 866 amino acids arranged in the sequence illustrated, by single letter code, in Figure 3 (SEQ ID NOS: 
5 and 6). The four putative transmembrane domains of the GluR3A receptor are as follows: one spans residues 
527-546 inclusive (TM-1), another spans residues 575-593 (TM-2), a third spans residues 604-622 (TM-3) and 
the fourth spans residues 796-816 (TM-4). Based on this assignment it is likely that the human GluR3A re- 
ceptor structure, in its natural membrane- bound form, consists of a 526 amino acid N-terminal extracellular 

20 domain, followed by a hydrophobic region containing four transmembrane domains and an extracellular, 50 ami- 
no acid C-terminal domain. 

Structurally related variants of the GluR parent receptors identified above also exist. Specifically, a struc- 
turally related variant of the human GluR3A receptor, namely the GluR3B receptor, has also been identified. 
This variant occurs naturally in human brain tissue, and like GluR3A, the GIuR3B receptor is 866 amino acids 

25 in length, as shown in Figure 4 (SEQ ID NOS: 7 and 8), in its mature, membrane-bound form. The GluR3B 
receptor initially bears a signal peptide identical to that borne on the GluR3A receptor. Four transmembrane 
domains are also apparent from the GluR3B sequence, and indicate that these domains lie in the same amino 
acid regions identified in connection with the GluR3A receptor. 

With respect to primary structure, the human GluR3B receptor differs from the GluR3A receptor in a 36 

30 amino acid region separating transmembrane domains TM-3 and TM-4, i.e. residues 748-783. For comparison, 
the sequences of GluR3Aand GiuR3B in this region are compared in Figure 5 (SEQ ID NOS: 9 and 10). 

Binding assays performed with various ligands, and with membrane preparations derived from mammalian 
cells engineered genetically to produce the human GluR receptors in membrane-bound form indicate that the 
human GluR receptors bind selectively to AM PA, relative particularly to kainate and NMDA. This feature, cou- 

35 pled with the medically significant connection between AMPA-type receptors and neurological disorders and 
disease indicate that the present receptors, as well as AMPA-binding fragments and variants thereof, will serve 
as valuable tools in the screening and discovery of ligands useful to modulate in vivo interactions between 
such receptors and their natural ligand, glutamate. Thus, a key aspect of the present invention resides in the 
construction of cells that are engineered genetically to produce a human GluR receptor, to serve as a ready 

40 and homogeneous source of receptor for use in vitro ligand binding and/or channel activation assays. 

For use in the ligand binding assays, it is desirable to construct by application of genetic engineering tech- 
niques a host cell, either prokaryotic or eukaryotic, that produces a human GluR receptor as a heterologous 
and membrane-bound product. According to one embodiment of the invention, the construction of such engi- 
neered cells is achieved by introducing into a selected host cell a recombinant DNA construct in which DNA 

45 coding for a secretable form of the desired human GluR receptor, i.e. ( a form bearing its native signal peptide 
or a functional, heterologous equivalent thereof, is linked operably with expression controlling elements that 
are functional in the selected host to drive expression of the receptor-encoding DNA, and thus elaborate the 
desired human GluR receptor protein. Such cells are herein characterized as having the receptor-encoding 
DNA incorporated "expressibly" therein. The receptor-encoding DNA is referred to as "heterologous" with re- 

so spect to the particular cellular host if such DNA is not naturally found in the particular host The particular ceil 
type selected to serve as host for production of the human GluR receptor can be any of several cell types cur- 
rently available in the art, including both prokaryotic and eukaryotic cells, but should not of course be a cell 
type that in its natural state elaborates a surface receptor that can bind excitatory amino acids, and so confuse 
the assay results sought from the engineered cell line. Generally, such problems are avoided by selecting as 

55 host a non-neuronal cell type, and can further be avoided using non-human cell lines, as is conventional. It 
will be appreciated that neuronal- and human-type cells may nevertheless serve as expression hosts, provided 
that "background" binding to the test ligand is accounted for in the assay results. 

According to one embodiment of the present invention, the cell line selected to serve as host for human 
GluR receptor production is a mammalian cell. Several types of such cell lines are currently available for ge- 
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netic-engineering work, and these include the Chinese hamster ovary (CHO) cells for example of K1 lineage 
(ATCC CCL61) including the Pro5 variant (ATCC CRL 1281); thefibroblast-like cells derived from SV40-trans- 
formed African Green monkey kidney of the CV-1 lineage (ATCC CCL 70), of the COS-1 lineage (ATCC CRL 

5 1650) and of the COS-7 lineage (ATCC CRL 1651); murine L-cells, murine 3T3 cells (ATCC CRL 1658), murine 
C127 cells, human embryonic kidney cells of the 293 lineage (ATCC CRL 1573), human carcinoma ceils in- 
cluding those of the HeLa lineage (ATCC CCL 2). and neuroblastoma cells of the lines I MR- 32 (ATCC CCL 
127), SK-N-MC (ATCC HTB 10) "and SK-N-SH (ATCC HTB 11). 

A variety of gene expression systems have been adapted for use with these hosts and are now commer- 

w daily available, and any one of these systems can be selected to drive expression of human GluR receptor- 
encoding DNA. These systems, available typically in the form of plasmidic vectors, incorporate expression cas- 
settes the functional components of which include DNA constituting expression controlling sequences, which 
are host-recognized and enable expression of the receptor-encoding DNA when linked 5* thereof. The systems 
further incorporate DNA sequences which terminate expression when linked 3' of the receptor-encoding re- 
ts gion. Thus, for expression in the selected mammalian cell host, there is generated a recombinant DNA expres- 
sion construct in which DNA coding for a secretable form of the receptor is linked with expression controlling 
DNA sequences recognized by the host, and which include a region 5' of the receptor-encoding DNA to drive 
expression, and a 3' region to terminate expression. The plasmidic vector harboring the recombinant DNA ex- 
pression construct typically incorporates such other functional components as an origin of replication, usually 

20 virally-derived, to permit replication of the plasmid in the expression host and desirably also for piasmid am- 
plification in a bacterial host, such as E.coli. To provide a marker-enabling selection of stably transformed re- 
combinant cells, the vector will also incorporate a gene conferring some survival advantage on the transfor- 
mants, such as a gene coding for neomycin resistance in which case the transformants are plated in medium 
supplemented with neomycin. 

25 Included among the various recombinant DNA expression systems that can be used to achieve mammalian 
cell expression of the receptor-encoding DNA are those that exploit promoters of viruses that infect mammalian 
cells, such as the promoter from the cytomegalovirus (CMV), the Rous sarcoma virus (RSV). simian virus 
(SV40), murine mammary tumor virus (MMTV) and others. Also useful to drive expression are promoters such 
as the LTR of retroviruses, insect cell promoters such as those regulated by temperature, and isolated from 

30 Drosophila, as well as mammalian gene promoters such as those regulated by heavy metals i.e.the metalo- 
thionein gene promoter, and other steroid-inducible promoters. 

For incorporation into the recombinant DNA expression vector, DNA coding for a selected human GluR re- 
ceptor, e.g. one of the human GIuRIB, GluR2B or GluR3A receptors, or an AMPA-binding fragment or variant 
thereof, e.g. GluR3B, can be obtained by applying selected techniques of gene isolation or gene synthesis. 

35 As described in more detail in the examples herein, human GluR receptors are encoded within the genome of 
human brain tissue, and can therefore be obtained from human DNA libraries by careful application of conven- 
tional gene isolation and cloning techniques. This typically will entail extraction of total messenger RNAfrom 
a fresh source of human brain tissue, preferably cerebellum or hippocampus tissue, followed by conversion of 
message to cDNA and formation of a library in for example a bacterial plasmid, more typically a bacteriophage. 

40 Such bacteriophage harboring fragments of the human DNA are typically grown by plating on a lawn of sus- 
ceptible E. coli bacteria, such that individual phage plaques or colonies can be isolated. The DNA carried by 
the phage colony is then typically immobilized on a nitrocellulose or nylon-based hybridization membrane, and 
then hybridized, under carefully controlled conditions, to a radioactively (or otherwise) labelled oligonucleotide 
probe of appropriate sequence to identify the particular phage colony carrying receptor-encoding DNAorfrag- 

45 ment thereof. It will be understood, for example, that selective hybridization, i.e. hybridization of a DNA se- 
quence that is completely complementary to the probe, will be conducted under stringent hybridization condi- 
tions. Typically, the gene or a portion thereof so identified is subcloned into a plasmidic vector for nucleic acid 
sequence analysis. 

In specific embodiments of the invention, the GluR1 B receptor is encoded by the DNAsequence illustrated 
50 in Figure 1 (SEQ ID NO; 1), the GluR2B receptor is encoded by the DNAsequence illustrated in Figure 2 (SEQ 
ID NO: 3) and the GluR3A and GluR3B receptors are encoded by the DNA sequences illustrated respectively 
in Figures 3 (SEQ ID NO: 5) and 4 (SEQ ID NO; 7). Alternatively, codons within the illustrated DNAsequences 
coding for the GluR receptors may be replaced by synonymous codon equivalents, such synonymous codon 
replacements being well-known in the art 
55 The illustrated DNA sequences constitute cDNA sequences identified in human brain cDNA libraries in the 

manner exemplified herein. Having herein provided the nucleotide sequence of various members of the human 
GluR receptor family, however, it will be appreciated that polynucleotides encoding the receptors can be ob- 
tained by other routes. Automated techniques of gene synthesis and/or amplification can be performed to gen- 
erate DNA coding therefor. Because of the length of the human GluR receptor-encoding DNA, application of 
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automated synthesis may require staged gene construction, in which regions of the gene up to about 300 nu- 
cleotides in length are synthesized individually and then ligated in correct succession by overhang comple- 
mentarity for final assembly. Individually synthesized gene regions can be-amplif ied prior to assembly, using 

5 established polymerase chain reaction (PCR) technology. 

By the application of automated gene synthesis techniques, there is provided a means to generate poly- 
nucleotides that encode variants of naturally occurring human GluR receptors, i.e. GluRIB, GluR2B, GluR3A 
and GluR3B. It will be appreciated, for example, that polynucleotides coding for the human GluR receptors 
herein described can be generated by substituting synonymous codons for those represented in the naturally 
- 10 occurring polynucleotide sequences herein identified. In addition, polynucleotides coding for human GluR re- 
ceptor variants can be generated which for example incorporate one or more e.g. 1-10, single amino acid sub- 
stitutions, deletions or additions. Since it will for the most part be desirable to retain the natural ligand binding 
profile of the receptor for screening purposes, it is desirable to limit amino acid substitutions, for example to 
the so-called conservative replacements in which amino acids of like charge are substituted, and to limit sub- 

15 stitutions to those sites less critical for receptor activity e.g. within about the first 20 N- terminal residues of 
the mature receptor, and such other regions as are elucidated upon receptor domain mapping. 

With appropriate template DNA in hand, the technique of PCR amplification may also be used to directly 
generate all or part of the final gene. In this case, primers are synthesized which will prime the PCR amplif h 
cation of the. final product, either in one piece, or in several pieces that may be ligated together. This may be 

20 via step-wise ligation of blunt ended, amplified DNA fragments, or preferentially via step-wise ligation of frag- 
ments containing naturally occurring restriction endonuclease sites. In this application, it is possible to use eith- 
er cDNA or genomic DNA as the template for the PCR amplification. In the former case, the cDNA template 
can be obtained from commercially available or self-constructed cDN A libraries of various human brain tissues, 
including hippocampus and cerebellum. 

25 Once obtained, the receptor-encoding DNA is incorporated for expression into any suitable expression vec- 

tor, and host cells are transfected therewith using conventional procedures, such as DNA-mediated transfor- 
mation, electroporation, or particle gun transformation. Expression vectors may be selected to provide trans- 
formed ceil lines that express the receptor-encoding DNA either transiently or in a stable manner. For transient 
expression, host cells are typically transformed with an expression vector harboring an origin of replication 

30 functional in a mammalian cell. For stable expression, such replication origins are unnecessary, but the vectors 
will typically harbour a gene coding for a product that confers on the transform ants a survival advantage, to 
enable their selection. Genes coding for such selectable markers include the E. coli gpt gene which confers 
resistance to mycophenolic acid, the neo gene from trans poson Tn5 which confers resistance to the antibiotic 
G41 8 and to neomycin, the dhfr sequence from murine cells or E. coli which changes the phenotype of DHFR- 

35 cells into DHFR+ cells, and the tfcgene of herpes simplex virus, which makes TK- cells phenotypically TK+ 
cells. Both transient expression and stable expression can provide transformed cell lines, and membrane prep- 
arations derived therefrom, for use in ligand screening assays. 

For use in screening assays, cells transiently expressing the receptor-encoding DNA can be stored frozen 
for later use, but because the rapid rate of piasmid replication will lead ultimately to cell death, usually in a few 

40 days, the transformed cells should be used as soon as possible. Such assays may be performed either with 
intact cells, or with membrane preparations derived from such cells. The membrane preparations typically pro- 
vide a more convenient substrate for the ligand binding experiments, and are therefore preferred as binding 
substrates. To prepare membrane preparations for screening purposes, i.e., ligand binding experiments, frozen 
intact cells are homogenized while in cold water suspension and a membrane pellet is collected after centri- 

45 fugation. The pellet is then washed in cold water, and diaiyzed to remove endogenous EAAIigands such as 
glutamate, that would otherwise compete for binding in the assays. The diaiyzed membranes may then be 
used as such, or after storage in lyophilized form, in the ligand binding assays. Alternatively, intact, fresh cells 
harvested about two days after transient transfection or after about the same period following fresh plating of 
stably transfected cells, can be used for ligand binding assays by the same methods as used for membrane 

so preparations. When cells are used, the cells must be harvested by more gentle centrifugation so as not to dam- 
age them, and all washing must be done in a buffered medium, for example in phosphate- buffered saline, to 
avoid osmotic shock and rupture of the cells. 

The binding of a substance, i.e., a candidate ligand, to a human GluR receptor of the invention is evaluated 
typically using a predetermined amount of cell-derived membrane (measured for example by protein determi- 

55 nation), generally from about 25\ig to 100u.g. Generally, competitive binding assays will be useful to evaluate 
the affinity of a test compound relative to AMPA. This competitive binding assay can be performed by incu- 
bating the membrane preparation with radiolabelled AMPA, for example PH]-AMPA, in the presence of untab- 
etled test compound added at varying concentrations. Following incubation, either displaced or bound radiolab- 
elled AMPA can be recovered and measured, to determine the relative binding affinities of the test compound 
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and AMPA for the particular receptor used as substrate. In this way, the affinities of various compounds for 
the AM PA- binding human EAA receptors can be measured. Alternatively, a radiolabeiled analogue of glutamate 
may be employed in place of radiolabeiled AMPA, as competing ligand. 

5 As an alternative to using cells that express receptor-encoding DNA, ligand characterization may also be 

performed using cells for example Xenopus oocytes, that yield functional membrane-bound receptor following 
introduction by injection either of reGeptor-encodlnn messenger RNA into the oocyte cytoplasm, or of receptor- 
encoding DNA into the oocyte nucleus. To generate the messenger RNA of cytoplasmic delivery, the receptor- 
encoding DNA is typically subcloned first into a plasmidic vector adjacent a suitable promoter region, such as 

10 the T3 or T7 bacteriophage promoters, to enable transcription into RNA message. RNA is then transcribed from 
the inserted gene in vitro, collected and then injected into Xenopus oocytes. Following the injection of nL vol- 
umes of an RNA solution, the oocytes are left to incubate for up to several days, and are then tested for the 
ability to respond to a particular ligand molecule supplied in a bathing solution. Since functional EAA receptors 
act in part by operating a membrane channel through which ions may selectively pass, the functioning of the 

is receptor in response to a particular ligand molecule in the bathing solution may typically be measured as an 
electrical current utilizing microelectrodes inserted into the cell, in the established manner. 

In addition to using the receptor-encoding DNA to construct cell lines useful for ligand screening, expres- 
sion of the DNA can, according to another aspect of the invention, be performed to produce AMPA-binding 
fragments of the receptor in soluble form, for structure investigation, to raise antibodies and for other experi- 

20 mental uses. It is expected that the portion of the human GluR receptor responsible for AMPA-binding resides 
on the outside of the cell, i.e., is extracellular. It is therefore desirable in the first instance to facilitate the char- 
acterization of the receptor-ligand interaction by providing this extracellular ligand-binding domain in quantity 
and in isolated form, i.e., free from the remainder of the receptor. To accomplish this, the full-length human 
GluR receptor-encoding DNA may be modified by site-directed mutagenesis, so as to introduce a translations) 

25 stop codon into the extracellular N-terminal region, immediately before the sequence encoding the first trans- 
membrane domain (TM1), i.e., before residue 521 of GluRIB, before residue 525 in GluR2B, or before residue 
527 of GluR3A and GluR3B. Since there will no longer be produced any transmembrane domain(s) to "anchor" 
the receptor into the membrane, expression of the modified gene will result in the secretion, in soluble form, 
of only the extracellular ligand-binding domain. Standard ligand-binding assays may then be performed to as- 

30 certain the degree of binding of a candidate compound to the extracellular domain so produced. It may of course 
be necessary, using site-directed mutagenesis, to produce several different versions of the extracellular re- 
gions, in order to optimize the degree of ligand binding to the isolated domains. 

For use in ligand binding assays according to the present invention, AMPA-binding fragments of the re- 
ceptor will first be anchored to a solid support using any one of various techniques. In one method, the C-ter- 

35 minal end of the receptor peptide fragment may be coupled to a derivatized, insoluble polymeric support, for 
example, cross-linked polystyrene or polyamide resin. Once anchored to the solid support, the frament is useful 
to screen candidate ligands for receptor binding affinity. For this purpose, competition-type ligand-binding as- 
says, as described above using full-length receptor, are commonly used. Fragments secured to a solid support 
are bound with a natural ligand, i.e. AMPA, in the presence of a candidate ligand. One of AMPA or candidate 

40 ligand is labelled, for example radioactiveiy, and following a suitable incubation period, the degree of AMPA 
displacement is determined by measuring the amount of bound or unbound label. 

Alternatively, it may be desirable to produce an extracellular domain of the receptor which is not derived 
from the amino-ter minus of the mature protein, but rather from the carboxy-terminus instead, for example do- 
mains immediately following the fourth transmembrane domain (TM4), i.e., residing between amino acid resh 

45 dues 809-888 of GluR1 B, residues 81 3-862 of GluR2B, or residues 81 7-866 of GluR3Aor GluR3B. In this case, 
site-directed mutagenesis and/or PCR-based amplification techniques may readily be used to provide a de- 
fined fragment of the gene encoding the receptor domain of interest Such a DNA sequence may be used to 
direct the expression of the desired receptor fragment, either intraceliularly, or in secreted fashion, provided 
that the DNA encoding the gene fragment is inserted adjacent to a translation start codon provided by the ex- 

so pression vector, and that the required translation reading frame is carefully conserved. 

It will be appreciated that the production of such AMPA-binding fragments of a GluR receptor may be ac- 
complished in a variety of host cells. Mammalian cells such as CHO cells may be used for this purpose, the 
expression typically being driven by an expression promoter capable of high-level expression, for example the 
CMV (cytomegalovirus) promoter. Alternately, non-mammalian cells, such as insect Sf9 (Spodopterafrugiper- 

55 da) cells may be used, with the expression typically being driven by expression promoters of the baculovirus, 
for example the strong, late polyhedrin protein promoter. Filamentous fungal expression systems may also be 
used to secrete large quantities of such extracellular domains of the EAA receptor. Aspergillus nidulans, for 
example, with the expression being driven by the aicA promoter, would constitute such an acceptable system. 
In addition to such expression hosts, it will be further appreciated that any prokaryotic or other eukaryotic ex- 
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pression system capable of expressing heterologous genes or gene fragments, whether intracellularly or ex- 
tracellularly would be similarly acceptable. 

For use particularly in detecting the presence and/or location of a human GluR receptor, for example in 

5 brain tissue, the present invention also provides, in another of its aspects, labelled antibody to a human GluR 
receptor. To raise such antibodies, there may be used as immunogen either the intact, soluble receptor or an 
immur.cgsr.icfrsgmeni thereof i.e. a fragment capable of eliciting an immune response, produced in a microbial 
or mammalian cell host as described above or by standard peptide synthesis techniques. Regions of human 
GluR receptor particularly suitable for use as immunogenic fragments include those corresponding in se- 

w quence to an extracellular region of the receptor, or a portion of the extracellular region. For example, peptides 
consisting of residues 1-526 of the GIuR3A receptor or a fragment thereof comprising at least about 10 resi- 
dues, including particularly fragments containing residues 178-193 or 479-522; and peptides corresponding 
to the region between transmembrane domains TM-2 and TM-3 of the GluR3A receptor, such as a peptide 
consisting of residues 594-603. Peptides consisting of the C-ter minal domain (residues 81 7-866 of the GluR3A 

is receptor), or fragment thereof, may also be used for the raising of antibodies. 

The raising of antibodies to the selected human GluR receptor or immunogenic fragment can be achieved, 
for polyclonal antibody production, using immunization protocols of conventional design, and any of a variety 
of mammalian hosts, such as sheep, goats and rabbits. Alternatively, for monoclonal antibody production, im- 
munocytes such as splenocytes can be recovered from the immunized animal and fused, using hybridoma tech- 

20 nology, to a myeloma cells. The fusion products are then screened by culturing in a selection medium, and 
cells producing antibody are recovered for continuous growth, and antibody recovery. Recovered antibody can 
then be coupled covalently to a detectable label, such as a radiolabel, enzyme label, luminescent label or the 
like, using linker technology established for this purpose. 

In d electa bly labelled form, e.g. radiolabeled form, DNA or RNA coding for a human GluR receptor, and 

25 selected regions thereof, may also be used, in accordance with another aspect of the present invention, as 
hybridization probes for example to identify sequence-related genes resident in the human or other mammalian 
genomes (or cDNA libraries) or to locate the human GluR-encoding DNA in a specimen, such as brain tissue. 
This can be done using either the intact coding region, or a fragment thereof having radiolabeled e.g. ^P, nu- 
cleotides incorporated therein. To identify the human GluR-encoding DNA in a specimen, it is desirable to use 

30 either the full length cDNA coding therefor, or a fragment which is unique thereto. With reference to Figures 
1-4 (SEQ ID NOS: 1-8), such nucleotide fragments include those comprising at least about 17 nucleic acids, 
and otherwise corresponding in sequence to a region coding for an extracellular N-terminal or C-ter minal re- 
gion of the receptor, or representing a S'-untranslated or 3'-untranslated region thereof. Such oligonucleotide 
sequences, and the intact gene itself, may also be used of course to clone human GluR-related human genes, 

35 particularly cDNA equivalents thereof, by standard hybridization techniques. 

Embodiments of the present invention are described in detail in the following specific examples which are 
not to be construed as limiting: 

Example 1 - Isolation of DNA coding for the human GluR3A receptor 

40 

The particular strategy used to clone the human GluR3A receptor is depicted schematically in Figure 6, 
and described in greater detail below. 

cDNA coding for the human GluR3A receptor was identified by probing human hippocampal cDNA that 
was obtained as an EcoRl-based lambda phage library (lambda ZAP) from Stratagene Cloning Systems (La 

45 Jolla, California, U.S.A.). The cDNA library was probed initially with a 1.1kb EcoRI/EcoRI DNA fragment con- 
stituting the 3' region of a kainate-binding human EAA receptor, designated humEAAJa. This particular kainate- 
binding receptor is described in EP-A-0 529 994 incorporated herein by reference. DNA coding for the human 
EAA1a receptor, and from which the 1 .1kb probe may be recovered, was deposited under terms of the Buda- 
pest Treaty, with the American Type Culture Collection in Rockville, Maryland U.S.A. on August 21, 1991 under 

50 accession number ATCC 75063; 

Hybridizations using the probe were carried out at 30C overnight, and filters were washed with 2xSSC 
containing 0.5% SDS at 25C for 5 minutes, followed by a 15 minute wash at 50C with 2xSSC containing 0.5% 
SDS. The final wash was with 1xSSC containing 0.5% SDS at 50C for 15 minutes. Filters were exposed to X- 
ray film (Kodak) overnight. Of 10 6 clones screened under the following hybridization conditions (6xSSC, 50% 

55 formamide, 5% Denhardt's solution, 0.5% SDS, 1 0Oug/ml denatured salmon sperm DNA), only two hippocam- 
pal cDNA library inserts were identified, one about 1.6kb and designated RKCH521 and another about 2.2kb 
and designated RKCH221 (Fig.6). For sequencing, the '521 and the '221 phages were plaque purified, then 
excised as phagemids according to the supplier's specifications, to generate insert-carrying Bluescript-SK va- 
riants of the phagemid vector. Sequencing of the '221 clone across its entire sequence revealed a putative ATG 
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initiation codon together with about 78 bases of 5'non-coding region and about 2.1 kb of coding region. Se- 
quencing across the '521 insert revealed a significant region of overlap with the '221 insert, and provided some 
additional 3' sequence, although no termination codon was located. 
5 There being no termination codon apparent in the '521 sequence, a 3' region of the gene was sought For 

this purpose, there was first synthesized an oligonucleotide probe capable of annealing to the 3* region of the 
rat GluR3 receptor sequence reported by Keinanen et al. supra. The specific sequence of the 32-P-labeIled 
probe is provided below (SEQ ID NO: 13): 

1Q 5' - ACACTCAGAATTACGCTACATACAGAGAAGG CTACAACGT - 3 ' 

The same hippocampal cDNA library was then re-screened using the rat-based probe and under the fol- 
lowing hybridization conditions; 6xSSC, 25% formamide, 5% Dernhardfs solution, 0.5% SDS, 100ug/ml de- 
natured salmon sperm DNA, 42C. This revealed a 1.2kb insert, designated RKCSHG132. Sequencing of the 
15 entire insert revealed 5' overlap with the 3'end of the previously isolated '521 insert, and also revealed a ter- 
mination codon as well as about 15 bases of 3' non-translated sequence. 

To provide the entire coding region in an intact clone, the strategy shown in Figure 6 was employed, to 
generate the phagemid pBS/HumGluR3A which carries the hGluR3A-encoding DNA as a 2.8kb EcoRI/EcoRI 
insert in a 3.0kb Bluescript-SK phagemid background. The entire sequence of the EcoRI/EcoRI insert is pro- 
20 vided in Figure 3 (SEQ ID NOS: 5 and 6). . 

The 5.8kb phagemid pBS/humGluR3A was deposited, under the terms of the Budapest Treaty, with the 
American Type Culture Collection in Rockville, Maryland USA on March 19, 1992, and has been assigned ac- 
cession number ATCC 75218. 

25 Example 2 - Isolation of DNA coding for human GluR3B receptor 

A human fetal brain cDNA library was also screened in the search for human GluR receptors. This- 
particular library was obtained as an EcoRI- based lambda gt10 library from Strategene Cloning Systems (La 
Jolla, California, U.S.A.). The library was first screened using as hybridization probe an oligonucleotide capable 

30 of hybridizing to a 3' region of the reported rat GIuR3 gene sequence. Screening using hybridization conditions 
as noted above (6xSSC, 25% formamide, 42C, etc.) revealed one insert about 2.3kb in size, designated 
RKCSFG34. After excision to release Bluescript-SK phagemids carrying the insert, sequencing revealed sub- 
stantial sequence identity between the '34 insert and the 3'end of the earlier isolated GluR3A clone, and sug- 
gested that the 5*end of the gene encoded on partially on the *34 insert was missing. To provide an assembled 

35 gene, a 5* region was excised from the GluR3A insert and used to generate the 5*end of the '34 insert, at an 
internal Hindlll site. This was achieved as depicted schematically in Figure 7. The resulting intact clone was 
designated human GluR3B. 

Sequence comparison between the GluR3A clone of Example 1 and the GluR3B clone of this Example 
revealed only a short region of dissimilarity which is illustrated, in terms of amino acid sequence, in Figure 5 

40 (SEQ ID NOS: 9 and 10). 

The 6.1 kb phagemid pBS/humGluR3B was deposited, under the terms of the Budapest Treaty, with the 
American Type Culture Collection in Rockville, Maryland USA on March 19, 1992, and has been assigned ac- 
cession number ATCC 75219. 

45 Example 3 - Isolation of DNA coding for the human GluRIB receptor 

cDNA coding for the human GluRIB receptor was identified by probing human fetal brain cDNA that was 
obtained as an EcoRI-based lambda phage library (lambda ZAP) from Stratagene Cloning Systems (La Jolla, 
California, U.S.A.). The cDNA library was screened using an oligonucleotide probe capable of annealing to the 
50 5* region of the rat GluR1 receptor sequence reported by Hollmann et al, supra. The specific sequence of the 
32-P-labelled probe is provided below (SEQ ID NO: 14): 

5 ' - C CAG AT CG AT ATTGTGAACATCAG CG ACACGTTTG AGATG - 3 ' 

55 The fetal brain cDNA library was screened under the following hybridization conditions; 6xSSC, 25% for- 

mamide, 5% Dernhardfs solution, 0.5% SDS, 100ug/ml denatured salmon sperm DNA, 42C. Filters were wash- 
ed with 2xSSC containing 0.5% SDS at 25C for 5 minutes, followed by a 15 minute wash at 50C with 2xSSC 
containing 0.5% SDS. The final wash was with 1xSSC containing 0.5% SDS at 50C for 15 minutes. Filters were 
exposed to X-ray film (Kodak) overnight. Of 10 6 clones screened, only one cDNA insert, of about 3.2kb, was 



11 



EP0 574 257 A2 



identified, and designated RKCSFG91. For sequencing, the '91 phage was plaque purified, then excised as 

a phagemid according to the supplier's specifications, to generate an insert-carrying Bluescript-SK variant of 

the phagemid vector. Sequencing of the '91 clone across its entire sequence revealed a putative ATG initiation 
5 codon together with about 61 bases of 5'non-coding region and 2,718 bases of coding region. Also revealed 

was a termination codon, as well as about 438 bases of 3' non-translated sequence. The entire sequence of 

ths EccRI/EccR! insert is provided in Figure 1 (SEQ !D NOS: 1 and 2). 

A 6.2kb phagemid designated pBS/humGluR1 B, carrying the receptor-encoding DNA as a 3.2kb 

EcoRI/EcoRI insert in a 3.0kb Bluescript-SK phagemid background, was deposited, under the terms of the Bu- 
10 dapest Treaty, with the American Type Culture Collection in Rockville, Maryland USA on May 28, 1992, and 

has been assigned accession number ATCC 75246. 

Example 4 - Isolation of DNA coding for the human GluR2B receptor 

is The particular strategy used to clone the human Glu2B receptor is depicted schematically in Figure 10, 

and described in greater detail below. 

cDN A coding for the human GluR2B receptor was identified by probing human hippocampal cDNAthat 

was obtained as an EcoRI-based lambda phage library (lambda ZAP) from Stratagene Cloning Systems (La 

Jolla, California, U.SA). The cDNA library was screened using an oligonucleotide probe capable of annealing 
20 to the 3* region of the rat GluR2 receptor sequence reported by Keinanen et al, supra. The specific sequence 

of the 32-P-labelled probe is provided below (SEQ ID NO: 15): 

5 ' - GTGAATGTGGAGCCAAGGACTCGGGAAGTAAG- 3 ' 

25 The hippocampal cDNA library was screened under the following hybridization conditions; 6xSSC, 25% 
formamide, 5% Dernhardt's solution, 0.5% SDS, 100ug/ml denatured salmon sperm DNA, 42C. Filters were 
washed with 2xSSC containing 0.5% SDS at 25C for 5 minutes, followed by a 15 minute wash at 50C with 
2xSSC containing 0.5% SDS. The final wash was with 1xSSC containing 0.5% SDS at 50C for 15 minutes. 
Filters were exposed to X-ray film (Kodak) overnight. Of 10 6 clones screened, only two cDN A inserts were iden- 

30 tified, one about 2.7kb and designated RKCSHG84 and another about 2.9kb and designated RKCSHG41 
(Fig.1 0). For sequencing, the '84 and the '41 phages were plaque purified, then excised as phagemids accord- 
ing to the supplier's specifications, to generate insert-carrying Bluescript-SK variants of the phagemid vector. 
Sequencing of the '84 clone across its entire sequence revealed a putative ATG initiation codon together with 
about 314 bases of 5'non-coding region and about 2.4 kb of coding region. Sequencing across the '41 insert 

35 revealed a significant region of overlap with the '84 insert, and also revealed a termination codon not found 
in the '84 insert as well as about 441 bases of 3' non-translated sequence. 

To provide the entire coding region in an intact clone, the strategy shown in Figure 10 was employed, to 
generate the phagemid pBS/HumGluR2B which carries the hGluR2B-encoding DNA as a 3.4kb EcoRI/Pstl in- 
sert in a 3,Okb Bluescript-SK phagemid background. The entire sequence of the EcoRI/Pstl insert is provided 

40 in Figure 2 (SEQ ID NOS: 3 and 4). 

The 6.4kb phagemid pBS/humGluR2B was deposited, under the terms of the Budapest Treaty, with the 
American Type Culture Collection in Rockville, Maryland USA on March 19, 1992, and has been assigned ac- 
cession number ATCC 75217. 

45 Example 5 - Construction of genetically engineered cells producing human GluR3A receptors 

The strategy depicted in Figure 8 was employed to facilitate incorporation of the GluR3A receptor-encoding 
cDNA into an expression vector. 

For transient expression in mammalian cells, cDNA coding for the human GluR3A receptor was incorpo- 
50 rated into the mammalian expression vector pcDNAI, which is available.commercially from Invitrogen Corpor- 
ation (San Diego, California, USA; catalogue number V490-20). This is a multifunctional 4.2kb plasmid vector 
designed for cDNA expression in eukaryotic systems, and cDNA analysis in prokaryotes. Incorporated on the 
vector are the CMV promoter and enhancer, splice segment and poiyadenylation signal, an SV40 and Polyoma 
virus origin of replication, and M1 3 origin to rescue single strand DNA for sequencing and mutagenesis, Sp6 
55 and T7 RNA promoters for the production of sense and anti-sense RNA transcripts and a Col E1-like high copy 
plasmid origin. A polylinker is located appropriately downstream of the CMV promoter (and 3* of the T7 pro- 
moter). 

To facilitate incorporation of the GluR3 A receptor-encoding cDNA into an expression vector, a NotI site was 
introduced onto the 5' flank of the Bluescript-SK cDNA insert, and the cDNA insert was then released from 
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pBS/humG!uR3A as a 2.8 kb Notl/Notl fragment, which was then incorporated at the Notl site in the pcDNAI 
polylinker. Sequencing across the Notl junction was performed, to confirm proper insert orientation in pcDNAI. 
The resulting plasmid, designated pcDNAI/humGluR3A, was then introduced for transient expression into a 

5 selected mammalian cell host, in this case the monkey-derived, fibroblast like cells of the COS-1 lineage (avail- 
able from the American Type Culture Collection, Rockville, Maryland as ATCC CRL 1650). 

For transient expression of thfl GluR3A-encoding DNA. COS-1 cells were transfected with approximately 
8ug DNA (as pcDNA1/humGluR3A) per 10 6 COS cells, by DEAE-mediated DNA transfection and treated with 
chloroquine according to the procedures described by Maniatis et al, supra. Briefly, COS-1 cells were plated 

10 at a density of 5 x 10 e cells/dish and then grown for 24 hours in FBS-supplemented DMEM/F12 medium. Me- 
dium was then removed and cells were washed in PBS and then in medium. There was then applied on the 
cells 10ml of a transfection solution containing DEAE d extra n (0.4mg/ml), 100uM chloroquine, 10% NuSerum, 
DNA (0.4mg/ml) in DMEM/F12 medium. After incubation for 3 hours at 37C, cells were washed in PBS and 
medium as just described and then shocked for 1 minute with 10% DMSO in DMEM/F12 medium. Cells were 

15 allowed to grow for 2-3 days in 10% FBS-supplemented medium, and at the end of incubation dishes were 
placed on ice, washed with ice cold PBS and then removed by scraping. Cells were then harvested by cen- 
trifugation at 1000 rpm for 10 minutes and the cellular pellet was frozen in liquid nitrogen, for subsequent use 
in ligand binding assays. Northern blot analysis of a thawed aliquot of frozen ceils confirmed expression of 
receptor-encoding cDNA in cells under storage. 

20 In a like manner, stably transfected cell lines can also prepared using two different cell types as host: CHO 
K1 and CHO Pro5. To construct these cell lines, cDNA coding for human GluR3A was incorporated into the 
mammalian expression vector pRC/CMV (Invitrogen), which enables stable expression. Insertion at this site 
placed the cDNA under the expression control of the cytomegalovirus promoter and upstream of the polyade- 
nylation site and terminator of the bovine growth hormone gene, and into a vector background comprising the 

25 neomycin resistance gene (driven by the SV40 early promoter) as selectable marker. 

To introduce pi asm ids constructed as described above, the host CHO cells are first seeded at a density 
of 5 x 10 s in 10% FBS-supplemented MEM medium. After growth for 24 hours, fresh medium are added to the 
plates and three hours later, the cells are transfected using the calcium phosphate-DNA co-precipitation pro- 
cedure (Maniatis et al, supra). Briefly, 3ug of DNA is mixed and incubated with buffered calcium solution for 

30 10 minutes at room temperature. An equal volume of buffered phosphate solution is added and the suspension 
is incubated for 15 minutes at room temperature. Next, the incubated suspension is applied to the cells for 4 
hours, removed and cells were shocked with medium containing 15% glycerol. Three minutes later, cells are 
washed with medium and incubated for 24 hours at normal growth conditions. Cells resistant to neomycin are 
selected in 10% FBS-supplemented alpha-MEM medium containing G418 (1mg/ml). Individual colonies of 

35 G41 8-resistant cells are isolated about 2-3 weeks later, clonally selected and then propogated for assay pur- 
poses. 

Example 6 - Construction of genetically engineered cells producing human GluRIB receptors 

40 The strategy depicted in Figure 9 was employed to facilitate incorporation of the GluR1 B receptor-encoding 

cDNA into an expression vector. Particularly, a Notl site was introduced onto the 3' flank of the Bluescript-SK 
cDNA insert, and the cDNA insert was then released from pBS/humGluR1 B as a 3.2kb Notl/Notl fragment, 
which was then incorporated at the Notl site in the pcDNAI polylinker. Sequencing across the junctions was 
performed, to confirm proper insert orientation in pcDNAI. The resulting plasmid, designated pcDNAI/ hum- 

45 GluR1 B, was then introduced for transient expression into monkey-derived, fibroblast like cells of the COS-1 
lineage as described above. 

For transient expression of the GluRIB-encoding DNA, COS-1 cells were transfected with approximately 
8ug DNA (as pcDNAI /humGluRIB) per 10 s COS ceils using the method described in Example 5. 

so Example 7 - Construction of genetically engineered cells producing human GluR2B receptors 

The strategy depicted in Figure 11 was employed to facilitate incorporation of the GluR2B receptor-encod- 
ing cDNA into an expression vector. Particularly, a Notl site was introduced onto the 5* flank of the Bluescript- 
SK cDNA insert, and the cDNA insert was then released from pBS/humGluR2B as a 3.4kb Hindlll/Notl frag- 
55 ment, which was then incorporated at the Hindlll/Notl sites in the pcDNAI polylinker. Sequencing across the 
junctions was performed, to confirm proper insert orientation in pcDNAI. The resulting plasmid, designated 
pcDNA1/humGluR2B, was then introduced for transient express ion into a selected mammalian cell host, in this 
case the monkey-derived, fibroblast like cells of the COS-1 lineage (available from the American Type Culture 
Collection, Rockville. Maryland as ATCC CRL 1650). 
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For transient expression of the GluR2B-encoding DNA, COS-1 ceils were transfected with approximately 
8ug DNA (as pcDNA1/humGIuR2B) per 10 6 COS cells as set out in Example 5. 

5 Example 8 - Ligand binding assays 

Transfected cells in the frozen state were resu-sp«nded in ice-cold distilled water using a hand homoce- 
nizer, sonicated for 5 seconds, and then centrifuged for 20 minutes at 50,000g. The supernatant was discarded 
and the membrane pellet stored frozen at-70C. 

10 COS cell membrane pellets were suspended in ice cold 50mM Tris-HCI (pH 7.55, 5C) and centrifuged again 

at 50,000g for 10 minutes in order to remove endogenous glutamate that would compete for binding. Pellets 
were resuspended in ice cold 50mM Tris-HCI (pH 7.55) buffer and the resultant membrane preparation was 
used as tissue source for binding experiments described below. Proteins were determined using the Pierce 
Reagent with BSA as standard. 

15 Binding assays were then performed, using an amount of COS-derived membrane equivalent to from 25- 

100ug as judged by protein determination and selected radiolabeled ligand. In particular, for AMPA-binding 
assays, incubation mixtures consisted of 25-100ug tissue protein and D,L-alpha-[5-methyl- 3 H]amino-3-hy- 
droxy-5-methylisoxazole-4-propionicacid ( 3 H-AMPA, 27.6Ci/mmoie, 10nM final) with 0.1 M KSCN and 2.5mM 
CaCI 2 inthe 1ml final volume. Non-specific binding was determined in the presence of 1mM L-glutamate. Sam- 

20 pies were incubated on ice for 60 minutes in plastic minivials, and bound and free ligand was separated by 
centrifugation for 30 minutes at 50,000g. Pellets were washed twice in 6ml of the cold incubation buffer, then 
5ml of Beckman Ready-Protein Plus scintillation cocktail was added, for counting. 

For kainate-binding assays, incubation mixtures consisted of 25-100ug tissue protein and Ivinylidene- 3 H] 
kainic acid (58Ci/mmole, 5nM final) in the cold incubation buffer, 1ml final volume. Non-specific binding was 

25 determined in the presence of 1mM L-glutamate. Samples were incubated as for the AMPA-binding assays, 
and bound and free ligand were separated by rapid filtration using a Brandel ceil harvester and GF/B filters 
pre-soaked in ice-cold 0.3% polyethyleneimine. Filters were washed twice in 6ml of the cold incubation buffer, 
then placed in scintillation vials with 5ml of Beckman Ready-Protein Plus scintillation cocktail for counting. 
• Assays performed in this manner, using membrane preparations derived from the human GluR3A receptor- 

30 producing COS cells, revealed specific binding of 25-30 fmole/mg protein at 10nM PH}-AMPA (Figure 14); using 
membrane preparations derived from the human GluRIB receptor-producing COS cells, specific binding of 
about 100-150 fmole/mg protein at 10nM PH]-AMPA was revealed (Figure 12); and using membrane prepara- 
tions derived from the human GluR2B receptor-producing COS cells, specific binding of 750-850 fmol/mg pro- 
tein at 1 OnM PH]-AMPA was revealed (Figure 1 3). Mock transfected cells exhibited no specific binding of any 

35 of the ligands tested. 

Scatchard analysis indicated that the recombinantly expressed human GluRIB and GluR2B receptors • 
each contain a single class of PH]-labelled AM PA binding sites with a dissociation constants (Kd) of 46 nM (Fig- 
ure 15) and about 36.3 ±7.4 nM (Figure 16), respectively. Further, the maximum AMPA-binding of the GluRIB 
and GluR2B receptors has been found to be 847 and 816 ± 302 fmol/mg protein, respectively. 

40 PH1-AMPA displacement assays have also been performed for the GluR2B receptors in COS cells to de- 
termine the relative binding affinity of selected ligands. These results, as illustrated in Figure 17, indicate the 
rank order of potency of the ligands in displacing 3 H-AMPA binding to the GluR2B receptor to be as follows: 
quisqualate = AM PA > DNQX > CNQX > glutamate > domoate > kainate 
These results demonstrate clearly that the human GluR receptors bind AM PA with specificity. This activity, 

45 coupled with the fact that there is little or no demonstrable binding of either kainate or NMDA, clearly assigns 
the human GluR receptors to be of the AMPA type of EAA receptor. Furthermore, this binding profile indicates 
that the receptor is binding in an authentic manner, and can therefore reliably predict the ligand binding "sig- 
nature" of its non-recombinant counterpart from the human brain. These features make the recombinant re- 
ceptor especially useful for selecting and characterizing ligand compounds which bind to the receptor, and/or 

50 for selecting and characterizing compounds which may act by displacing other ligands from the receptor. The 
isolation of the GluR receptor genes in substantially pure form, capable of being expressed as a single, ho- 
mogeneous receptor species, therefore frees the ligand binding assay from the lack of precision introduced 
when complex, heterogeneous receptor preparations from human and other mammalian brains are used to at- 
tempt such characterizations. 

55 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: AIXELTY BIOPHARKACEUTICALS 

(B) STREET: 6850 Go re way Drive 

(C) CITY: Mississauga 

(D) STATE OR PROVINCE: Ontario 

(E) COUNTRY: Canada 

(P) POSTAL CODE: L4V 1P1 

(G) TELEPHONE: (416) 677-0831 

(H) PAX: (416) 677-9595 

(ii) TITLE OF INVENTION: AMP A-B IND I NG HUMAN GLUTAMATE RECEPTORS 
(ill) NUMBER OF SEQUENCES: 15 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC— DOS /MS— DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(V) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Unknown 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/896,437 

(B) FILING DATE: 10-JUN-1992 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/896,611 

(B) FILING DATE: 10-JUN-1992 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/896,612 

(B) FILING DATE: 10-JUN-1992 

(2) INFORMATION FOR SEQ ID NO:lx 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Lx) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 62.. 2782 

( ix ) FEATURE x 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 62.. 115 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 116.. 2782 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CAATTCCACA CCAAATCTAT GATTCGACCT GGCCTTCTTT TTCGCCAATG CAAAAAGGAA 60 

T ATG CAG CAC ATT TTT GCC TTC TTC TGC ACC GGT TTC CTA GGC GCG 106 
Met Gin Hia lie Phe Ala Phe Phe Cya Thr Gly Phe Leu Gly Ala 
-18 -15 -10 -s 

GTA GTA GGT GCC AAT TTC CCC AAC AAT ATC CAG ATC CGG GGA TTA TTT 154 
Val Val Gly Ala Asn Phe Pro Aan Aan lie Gin lie Gly Gly Leu Phe 
15 10 

CCA AAC CAG CAG TCA CAG CAA CAT GCT GCT TTT AGA TTT GCT TTG TCG 202 
Pro Aan Gin Gin Ser Gin Glu Hia Ala Ala Phe Arg Phe Ala Leu Ser 
15 20 25 

CAA CTC ACA GAG CCC CCG AAG CTG CTC CCC CAG ATT CAT ATT GTG AAC 250 
Gin Leu Thr Glu Pro Pro Lya Leu Leu Pro Gin lie Asp He Val Aan 
30 35 40 ' 45 - 

20 ATC AGC GAC ACG TTT GAG ATG ACC TAT AGA TTC TGT TCC CAG TTC TCC 29 8 _ 

He Ser Aap Thr Phe Glu Met Thr Tyr Arg Phe Cya Ser Gin Phe Ser 
50 55 60 

AAA GGA GTC TAT GCC ATC TTT GGG TTT TAT GAA CGT AGG ACT GTC AAC 346 
Lya Gly Val Tyr Ala He Phe Gly Phe Tyr Glu Arg Arg Thr Val Aan 
25 65 70 75 

ATG CTG ACC TCC TTT TGT GGG GCC CTC CAC GTC TGC TTC ATT ACG CCG 394 
Met Leu Thr Ser Phe Cya Gly Ala Leu Hia Val Cya Phe He Thr Pro 
80 85 90 

AGC TTT CCC GTT CAT ACA TCC AAT CAG TTT GTC CTT CAG CTG CCC CCT 442 
Ser Phe Pro Val Aap Thr Ser Aan Gin Phe Val Leu Gin Leu Arg Pro 
95 100 105 

GAA CTG CAG GAT GCC CTC ATC AGC ATC ATT GAC CAT TAC AAG TGC CAG 490 
Glu Leu Gin Asp Ala Leu He Ser He He Aap Hia Tyr Lya Trp Gin 
110 115 120 125 

AAA TTT GTC TAC ATT TAT GAT GCC GAC CGG GGC TTA TCC GTC CTG CAG 538 
Lya Phe Val Tyr He Tyr Aap Ala Aap Arg Gly Leu Ser Val Leu Gin 
130 135 140 

AAA GTC CTG GAT ACA GCT GCT GAG AAG AAC TGG CAG GTG ACA GGA GTC 586 
Lya Val Leu Aap Thr Ala Ala Glu Lye Aan Trp Gin Val Thr Ala Val 
145 150 155 

AAC ATT TTG ACA ACC ACA GAG GAG GGA TAC CGG ATG CTC TTT CAG CAC 634 
Aan He Leu Thr Thr Thr Glu Glu Gly Tyr Arg Met Leu Phe Gin Aap 
160 165 170 

45 CTG CAG AAC AAA AAC GAG CGG CTG GTG GTG GTG GAC TGT GAA TCA GAA 682 
Leu Glu Lya Lya Lya Glu Arg Leu Val Val Val Aap Cya Glu Ser Glu 
175 180 185 



30 



35 



40 



50 



55 



CGC CTC AAT CCT ATC TTG GGC CAC ATT ATA AAG CTA GAG AAG AAT GCC 730 
Arg Leu Aan Ala He Leu Gly Gin He He Lya Leu Glu Lye Aan Gly 
190 195 200 205 

ATC GGC TAC CAC TAC ATT CTT GCA AAT CTG GGC TTC ATG GAC ATT GAC 778 
He Gly Tyr Hia Tyr He Leu Ala Aan Leu Gly Phe Met Aap He Aap 
210 215 220 

TTA AAC AAA TTC AAG GAG AGT GGC GCC AAT GTG ACA GGT TTC CAG CTG 826 
Leu Aan Lya Phe Lya Glu Ser Gly Ala Aan Val Thr Gly Phe Gin Leu 
225 230 235 
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CTG AAC TAC ACA GAC ACT ATT CCG GCC AAG ATC ATG CAG CAG TGG AAG 874 

Val Asn Tyr Thr Aep Thr I la Pro Ala Lya He Met Gin Gin Trp Lya 
240 245 250 

AAT AGT GAT GCT CGA GAC CAC ACA COG GIG GAC TGG AAG AGA CCC AAG 922 

Aan Ser Asp Ala Arg Aap Hia Thr Arg Val Asp Trp Lya Arg Pro Lya 
255 260 265 

TAC ACC TCT GCG CTC ACC TAC GAT GGG GTG AAG GTG ATG GCT GAG GC*t 970 

Tyr Thr Ser Ala Leu Thr Tyr Asp Gly Val Lya Val Met Ala Glu Ala 
270 275 280 285 

TTC CAG AGC CTG CGG AGG CAG AGA ATT CAT ATA TCT CGC CGG GGG AAT 1018 

Phe Gin Ser Leu Arg Arg Gin Arg He Asp He Ser Arg Arg Gly Asn 

15 290 295 300 

GCT GGG GAT TGT CTG GCT AAC CCA GCT GTT CCC TGG GCC CAA GGG ATC 1066 

Ala Gly Asp Cys Leu Ala Asn Pro Ala Val Pro Trp Gly Gin Gly lie 
305 . 310 315 



20 



25 



GAC ATC CAG AGA GCT CTG CAG CAG GTG CGA TTT GAA GGT TTA ACA GGA 1114 
Asp He Gin Arg Ala Leu Gin Gin Val Arg Phe Glu Gly Leu Thr Glv 
320 325 330 

AAC GTG CAG TTT AAT GAG AAA GGA CGC CGC ACC AAC TAC ACG CTC CAC 1162 
Asn Val Gin Phe Aan Glu Lya Gly Arg Arg Thr Asn Tyr Thr Leu Hi a 
33S 340 345 

GTG ATT GAA ATG AAA CAT GAC GGC ATC CGA AAG ATT GGT TAC TGG AAT 1210 
Val He Glu Met Lys His Asp Gly He Arg Lys He Gly Tyr Trp Aan 
350 355 360 365 

GAA GAT GAT AAG TTT GTC CCT CCA GCC ACC GAT GCC CAA GCT GGG GCC 1258 
Glu Asp Asp Lya Phe Val Pro Ala Ala Thr Asp Ala Gin Ala Gly Gly 
30 370 375 380 

GAT AAT TCA ACT GTT CAG AAC AGA ACA TAC ATC GTC ACA ACA ATC CTA 1306 
Asp Asn Ser Ser Val Gin Asn Arg Thr Tyr He Val Thr Thr He Leu 
385 390 395 

35 GAA GAT CCT TAT GTG ATG CTC AAG AAG AAC GCC AAT CAG TTT GAG GGC 1354 

Glu Asp Pro Tyr Val Met Leu Lys Lya Asn Ala Asn Gin Phe Glu Gly 
400 405 410 

AAT GAC CGT TAC GAG GGC TAC TGT GTA GAG CTG GCG GCA GAG ATT GCC 1402 
Asn Asp Arg Tyr Glu Gly Tyr Cys Val Glu Leu Ala Ala Glu He Ala 
40 415 «0 425 

AAG CAC GTG GCC TAC TCC TAC CGT CTG GAG ATT GTC ACT CAT GGA AAA 1450 
Lys His Val Gly Tyr Ser Tyr Arg Leu Glu He Val Ser Aap Gly Lya 
430 435 440 445 

TAC GGA GCC CGA GAC CCT CAC ACG AAG GCC TGG AAT GGC ATG GTG GCA 1498 
45 Tyr Gly Ala Arg Asp Pro Aap Thr Lya Ala Trp Aan Gly Met Val Gly 
450 455 460 

GAG CTG GTC TAT GGA AGA GCA GAT GTG GCT GTG GCT CCC TTA ACT ATC 1546 
Glu Leu Val Tyr Gly Arg Ala Aep Val Ala Val Ala Pro Leu Thr lie 
465 470 475 

50 ' ACT TTC CTC CGC GAA GAA GTT ATA GAT TTC TCC AAA CCA TTT ATG ACT 1594 
Thr Leu Val Arg Glu Glu Val He Aap Phe Ser Lys Pro Phe Met Ser 
480 485 490 

TTG GGG ATC TCC ATC ATG ATT AAA AAA CCA CAG AAA TCC AAG CCG GGT 1642 
Leu Gly He Ser He Met He Lya Lya Pro Gin Lya Ser Lya Pro Gly 
55 495 500 505 
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GTC TTC TCC TTC CTT CAT CCT TTG GCT TAT GAG ATT TGG ATG TGC ATT 1690 

Val Phe Ser Phe Leu Asp Pro Lau Ala Tyr Glu lie Trp Met Cys He 

510 515 520 525 

GTT XTT GCC TAC ATT GGA GTG AGT GTT GTC CTC TTC CTG GTC AGC CGC 1738 

Val Phe Ala Tyr lie Gly Val Ser Val Val Leu Phe Lau Val Ser Arg 
530 535 540 

TTC AGT CCC TAT GAA TGG CAC AGT GAA GAG TTT GAG GAA GGA CGG GAC 1786 

Phe Ser Pro Tyr Clu Trp His Ser Glu Glu Phe Glu Glu Gly Arg Asp 
545 550 555 

CAG AC A ACC AGT GAC CAG TCC AAT GAG TTT GGG ATA TTC AAC AGT TTG 1834 

Gin Thr Thr Ser Asp Gin Ser Aan Clu Phe Gly He Phe Asn Ser Leu 
15 560 565 570 

TGG TTC TCC CTG GGA GCC TTC ATG CAG GAA GGA TGT GAC ATT TCT CCC 1882 

Trp Phe Ser Leu Gly Ala Phe Met Gin Gin Gly Cys Asp He Ser Pro 
575 580 S85 

20 AGG TCC CTG TCT GGT CGC ATC GTT GGT GGC CTC TGC TGG TTC TTC ACC 1930 

. Arg Ser Leu Ser Gly Arg lie Val Gly Gly Val Trp Trp Phe Phe Thr 

590 595 600 605 

TTA ATC ATC ATC TCC TCA TAT ACA GCC AAT CTG GCC GCC TTC CTG ACC 1978 

Leu He He He Ser Ser Tyr Thr Ala Asn Leu Ala Ala Phe Leu Thr 
25 610 615 620 

GTG GAG AGG ATG GTG TCT CCC ATT GAG AGT GCA GAG GAC CTA GCG AAC 2026 

Val Glu Arg Met Val Ser Pro He Glu Ser Ala Glu Asp Leu Ala Aan 
625 630 635 

GAG ACA GAA ATT GCC TAC GGG ACC CTG GAA GCA GGA TCT ACT AAG GAG 2074 

30 Glu Thr Glu He Ala Tyr Gly Thr Leu Glu Ala Gly Ser Thr Lys Glu 
640 645 650 

TTC TTC AGG AGG TCT AAA ATT GCT GTG TTT GAG AAG ATG TGG ACA TAC 2122 

Phe Phe Arg Arg Ser Lys lie Ala Val Phe Glu Lys Met Trp Thr Tyr 
655 660 665 



35 ATG AAG TCA GCA GAG CCA TCA GTT TTT GTG CGG ACC ACA GAG GAG GGG 
Met Lys Ser Ala Glu Pro Ser Val Phe Val Arg Thr Thr Glu Glu Gly 
670 675 680 685 



45 



50 



55 



2170 



ATG ATT CGA GTG AGG AAA TCC AAA GGC AAA TAT GCC TAC CTC CTG GAG 2218 
Met He Arg Val Arg Lys Ser Lys Gly Lys Tyr Ala Tyr Leu Leu Glu 
40 690 695 700 

TCC ACC ATG AAT GAG TAC ATT GAG CAG CGG AAA CCC TGT GAC ACC ATG 2266 
Ser Thr Met Asn Glu Tyr He Glu Gin Arg Lys Pro Cys Asp Thr Met 
705 710 715 



AAG GTG GGA GGT AAC TTG GAT TCC AAA GGC TAT GGC ATT GCA ACA CCC 2314 
Lys Val Gly Gly Asn Leu Asp Ser Lys Gly Tyr Gly He Ala Thr Pro 
720 725. 730 

AAG GGG TCT GCC CTG AGA GGT CCC GTA AAC CTA GCG GTT TTG AAA CTC 2362 
Lys Gly Ser Ala Leu Arg Gly Pro Val Asn Leu Ala Val Leu Lys Leu 
735 740 745 

AGT GAG GAA GGC GTC TTA GAC AAG CTG AAA AGC AAA TGG TGG TAC GAT 2410 
Ser Glu Gin Gly Val Leu Asp Lys Leu Lys Ser Lys Trp Trp Tyr Asp 
750 755 760 765 

AAA GGG GAA TGT GGA AGC AAG GAC TCC GGA AGT AAG GAC AAG ACA AGC 2458 
Lys Gly Glu Cys Gly Ser Lys Asp Ser Gly Ser Lys Asp Lye Thr Ser 

770 775 780 
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15 



25 



30 



35 



50 



55 



OCT CTG ACC CTC AGC AAT GTG 'GCA GGC CTG TTC TAC ATC CTG ATC GGA 2506 
Ala Leu Ser Leu Ser Asn Val Ala Gly Val Phe Tyr lie Leu He Gly 
785 790 795 

GGA CTT GGA CTA GCC ATG CTG GTT GCC TTA ATC GAG TTC TGC TAC AAA 2554 
Gly Leu Gly Leu Ala Met Leu Val Ala Leu He Glu Phe Cya Tyr Lvs 
800 805 810 

TCC CGT AGT GAA TCC AAG CGG ATG AAG GGT TTT TGT TTC ATC CCA CAG 2602 
Ser Arg Ser Glu Ser Lyo Arg Met Lya Gly Phe Cyg Leu He Pro Gin 
815 820 825 

CAA TCC ATC AAC GAA GCC ATA CGG ACA TCG ACC CTC CCC CGC AAC AGC 2650 
Gin Ser He Aan Glu Ala He Arg Thr Ser Thr Leu Pro Arg Asn Ser 
830 835 840 845 

CGG GCA GGA GCC AGC AGC GGC GGC AGT GGA GAG AAT GGT CGG CTG GTC 2698 
Gly Ala Gly Ala Ser Ser Gly Gly Ser Gly Glu Aon Gly Arg Val Val 
850 855 860 

AGC CAT GAC TTC CCC AAG TCC ATG CAA TCC ATT CCT TGC ATG AGC CAC 2746 
Ser Hia Asp Phe Pro Lys Ser Met Gin Ser He Pro Cys Met Ser His 
865 870 875 

AGT TCA GGG ATG CCC TTG GGA GCC ACG GGA TTG TAACTGGACC AGATGGAGAC 2799 
Ser Ser Gly Met Pro Leu Gly Ala Thr Gly Leu 
880 885 



CCCTTGGGGA 


GCAGGCTCGG 


GCTCCCCAGC 


CCCATCCCAA ACCCTTCAGT GCCAAAAACA 


2859 


ACAACAAAAT 


AGAAACCGCA 


ACCACCACCA 


ACCACTGCGA 


CCACAAGAAG G ATG ATT CAA 


2919 


CAGGTTTTCC 


TGAAGAATTG 


AAAAACCATT 


TTGCTGTCCC 


TTTTCCTTTT TTGATGTTCT 


2979 


TTCACCCTTT 


TCTGTTTGCT 


AAGTGAGGAT 


GAAAAAATAA 


CACTGTACTG CAATAAGGGG 


3039 


AGAGTAACCC 


TGTCTAATGA 


AACCTGTGTC 


TCTGAGAGTA 


GAGTCACTCG AACACTAATG 


3099 


AGGAAACTGC 


ACTGTTTTAT 


TTTAATTCAG 


TTGTTAGTGT 


GTCTTAGTGT GTGCAATTTT 


3159 


TTTTCTTACT 


AATATCCATG 


GTTTGCAGGT 


TCTGTTAGGC 


CCTTTCCTTC TCCTGGAATT 


3219 


C 










3220 



(2) INFORMATION FOR SEQ ID NO: 2: 

40 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH! 906 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (li) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gin Hia He Phe Ala Phe Phe Cys Thr Gly Phe Leu Gly Ala Val 
-18 -15 -10 -5 



Val Gly Ala Aan Phe Pro Aan Asn He Gin He Gly Gly Leu Phe Pro 

15 10 

Asn Gin Gin Ser Gin Glu Hia Ala Ala Phe Arg Phe Ala Leu Ser Gin 

15 20 25 30 

Leu Thr Glu Pro Pro Lya Leu Leu Pro Gin He Asp He Val Asn He 

35 40 45 
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Ser Asp Thr Phe Glu Met Thr Tyr Arg Phe Cya Ser Gin Phe Ser Lye 
50 55 60 

Gly Val Tyr Ala He Phe Gly Phe Tyr Glu Arg Arg Thr Val Aan Met 
65 70 75 

Leu Thr Ser Phe Cye Gly Ala Leu His Val Cya Phe He Thr Pro Ser 
80 85 90 

Phe Pro Val Asp Thr Ser Aan Gin Phe Val Leu Gin Leu Arg Pro Glu 
95 100 105 110 

Leu Gin Aap Ala Leu He Ser He He Aap Hia Tyr Lya Trp Gin Lya 
115 120 125 

Phe Val Tyr He Tyr Aap Ala Aap Arg Gly Leu Ser Val Leu Gin Lya 
130 135 140 

Val Leu Aap Thr Ala Ala Glu Lya Aan Trp Gin Val Thr Ala Val Aan 
145 150 155 

He Leu Thr Thr Thr Glu Glu Gly Tyr Arg Met Leu Phe Gin Aap Leu 
160 165 170 

Glu Lya Lya Lys Glu Arg Leu Val Val Val Aap Cya Glu Ser Glu Arg 
175 180 185 190 

Leu Aan Ala He Leu Gly Gin He He Lya Leu Glu Lye Aan Gly He 
195 200 205 

Gly Tyr Hia Tyr He Leu Ala Aan Leu Gly Phe Met Aap He Aap Leu 
210 215 220 

Asn Lya Phe Lya Glu Ser Gly Ala Aan Val Thr Gly Phe Gin Leu Val 
225 230 235 

Asn Tyr Thr Aap Thr He Pro Ala Lya He Met Gin Gin Trp Lya Aan 
240 245 250 

Ser Asp Ala Arg Asp His Thr Arg Val Asp Trp Lys Arg Pro Lya Tyr 
255 260 265 270 

Thr Ser Ala Leu Thr Tyr Aap Gly Val Lys Val Met Ala Glu Ala Phe 
275 280 285 

Gin Ser Leu Arg Arg Gin Arg He Asp He Ser Arg Arg Gly Aan Ala 
290 295 300 

Gly Asp Cye Leu Ala Asn Pro Ala Val Pro Trp Gly Gin Gly He Asp 
305 310 315 

*5 He Gin Arg Ala Leu Gin Gin Val Arg Phe Glu Gly Leu Thr Gly Aan 

320 325 330 

Val Gin Phe Aan Glu Lya Gly Arg Arg Thr Aan Tyr Thr Leu His Val 
335 340 345 350 

50 He Glu Met Lys His Asp Gly He Arg Lye He Gly Tyr Trp Aan Glu 

355 1 360 365 

Aap Asp Lya Phe Val Pro Ala Ala Thr Aap Ala Gin Ala Gly Gly Aap 
370 375 3S0 

55 Asn Ser Ser Val Gin Aan Arg Thr Tyr He Val Thr Thr He Leu Glu 

385 390 395 
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Asp Pro Tyr Val Met Leu Lys Lys Asn Ala Asn Gin Phe Glu Gly Asn 
400 405 410 

Asp Arg Tyr Glu Gly Tyr Cys Val Glu Leu Ala Ala Glu He Ala Lys 
415 420 425 430 

His Val Gly Tyr Ser Tyr Arg Leu Glu He Val Ser Asp Gly Lys Tyr 
435 440 445 

Gly Ala Arg Asp Pro Asp Thr Lya Ala Trp Asn Gly Met Val Gly Glu 
450 455 460 

Lou Val Tyr Gly Arg Ala Asp Val Ala Val Ala Pro Leu Thr He Thr 
465 470 475 

15 Leu Val Arg Glu Glu Val He Asp Phe Ser Lys Pro Phe Met Ser Leu 

480 485 490 

Gly He Ser He Met He Lys Lys Pro Gin Lys Ser Lys Pro Gly Val 
495 500 505 510 

20 Phe Ser Phe Leu Asp Pro Leu Ala Tyr Glu He Trp Met Cys He Val 

515 520 525 

Phe Ala Tyr He Gly Val Ser Val Val Leu Phe Leu Val Ser Arg Phe 
530 535 540 

25 Ser Pro Tyr Glu Trp His Ser Glu Glu Phe Glu Glu Gly Arg Asp Gin 

545 550 555 

Thr Thr Ser Asp Gin Ser Asn Glu Phe Gly He Phe Asn Ser Leu Trp 
560 565 570 

30 Phe Ser Leu Gly Ala Phe Met Gin Gin Gly Cys Asp He Ser Pro Arg 

575 580 585 590 

Ser Leu Ser Gly Arg He Val Gly Gly Val Trp Trp Phe Phe Thr Leu 
595 600 605 

35 He He He Ser Ser Tyr Thr Ala Asn Leu Ala Ala Phe Leu Thr Val 

610 615 620 

Glu Arg Met Val Ser Pro He Glu Ser Ala Glu Asp Leu Ala Asn Glu 
625 630 635 

40 Thr Glu He Ala Tyr Gly Thr Leu Glu Ala Gly Ser Thr Lys Glu Phe 

640 645 650 

Phe Arg Arg Ser Lys He Ala Val Phe Glu Lys Met Trp Thr Tyr Met 
655 660 665 670 

45 Lys Ser Ala Glu Pro Ser Val Phe Val Arg Thr Thr Glu Glu Gly Met 

675 680 685 

He Arg Val Arg Lys Ser Lys Gly Lys Tyr Ala Tyr Leu Leu Glu Ser 
690 695 700 

50 Thr Met Asn Glu Tyr He Glu Gin Arg Lys Pro Cys Asp Thr Met Lys 

705 710 ' 71S 

Val Gly Gly Asn Leu Asp Ser Lys Gly Tyr Gly He Ala Thr Pro Lys 
720 725 730 



55 



Gly Ser Ala Leu Arg Gly Pro Val Asn Leu Ala Val Leu Lys Leu Ser 
735 740 745 750 
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25 



45 



50 



55 



Glu Gin Gly Val Leu Asp Lys Leu Lye Ser Lys Trp Trp Tyr Asp Lys 
755 760 765 

Cly Glu Cys Gly Ser Lya Asp Ser Gly Ser Lys Asp Lys Thr Ser Ala 

n« ttc 

Leu Ser Leu Ser Asn Val Ala Gly Val Phe Tyr lie Leu lie Gly Gly 
785 790 795 

Leu Gly Leu Ala Met Leu Val Ala Leu He Glu Phe Cys Tyr Lys Ser 
800 80S 810 

Arg Ser Glu Ser Lys Arg Met Lys Gly Phe Cys Leu He Pro Gin Gin 
815 820 825 830 

Ser lie Asn Glu Ala He Arg Thr Ser Thr Leu Pro Arg Asn Ser Gly 
835 840 845 

Ala Gly Ala Ser Ser Gly Gly Ser Cly Glu Asn Gly Arg Val Val Ser' 
850 855 860 

His Asp Phe Pro. Lys Ser Met Gin Ser lie Pro Cys Met Ser His Ser 
865 870 875 

Ser Gly Met Pro Leu Gly Ala Thr Gly Leu 
880 885 



(2) INFORMATION FOR SBQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3407 base pairs 

(B) TYPE: nucleic acid 

2, (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

35 (A) NAME /KEY: CDS 

(B) LOCATION: 315.. 2966 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

<B) LOCATION: 315.. 374 

40 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 37S..2966 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAATTCCGTG AGTGCATGGG AGGGTGCTGA ATATTCCGAG ACACTGGGAC CACAGCGGCA 60 

GCTCCGCTGA AAACTGCATT CAGCCAGTCC TCCGGACTTC TGGAGCGGGG ACAGGGCGCA 120 

GGGCATCAGC AGCCACCAGC AGGACCTGCG AAATAGGGAT TCTTCTGCCT CCACTTCAGG 180 

TTTTAGCAGC TTCGTCCTAA ATTGCTGTCT CAAAATGCAG AGGATCTAAT TTGCAGAGGA 240 

AAACAGCCAA AGAAGGAAGA GGAGGAAAAG GAAAAAAAAA GGGGTATATT GTGGATGCTC 300 

TACTTTTCTT GGAA ATG CAA AAG ATT ATG CAT ATT TCT GTC CTC CTT TCT 350 

Met Gin Lye He Met His He Ser Val Leu Leu Ser 
-20 -15 -10 
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5 CCT GTT TTA TGG GGA CTG ATT TTT GOT GTC TCT TCT AAC AGC ATA CAG 398 
Pro Val Leu Trp Gly Leu lie Phe Gly Val Ser Ser Asn Ser He Gin 
-5 is 

ATA GGG GGG CTA TTT CCT AGG GGC GCC GAT CAA GAA TAC AGT GGA TTT 446 
He Gly Gly Leu Phe Pro Arg Gly Ala Asp Gin Glu Tyr Ser Ala Phe 
10 15 20 

10 

CGA GTA GGG ATG GTT CAG TTT TCC ACT TCG GAG TTC AGA CTG ACA CCC 494 
Arg Val Gly Met Val Gin Phe Ser Thr Ser Glu Phe Arg Leu Thr Pro 
25 30 35 40 

CAC ATC GAC AAT TTG GAG GTG CCA AAC AGC TTC GGA GTC ACT AAT GCT 542 
15 His He Asp Asn Leu Glu Val Ala Asn Ser Phe Ala Val Thr Asn Ala 
45 50 55 

TTC TGC TCC CAG TTT TCG AGA GGA GTC TAT GCT ATT TTT GGA TTT TAT 590 
Phe Cys Ser Gin Phe Ser Arg Gly Val Tyr Ala He Phe Gly Phe Tyr 
60 65 70 

20 

GAC AAG AAG TCT GTA AAT ACC ATC ACA TCA TTT TGC GGA ACA CTC CAC 638 
Asp Lya Lys Ser Val Asn Thr He Thr Ser Phe Cys Gly Thr Leu His 
75 80 85 

GTC TCC TTC ATC ACT CCC AGC TTC CCA ACA GAT GGC ACA CAT CCA TTT 686 
Val Ser Phe He Thr Pro Ser Phe Pro Thr Asp Gly Thr His Pro Phe 
25 90 95 100 

GTC ATT CAG ATG AGA CCC GAC CTC AAA GGA GCT CTC CTT AGC TTG ATT 734 
Val He Gin Met Arg Pro Asp Leu Lys Gly Ala Leu Leu Ser Leu He 
105 110 115 120 

30 GAA TAC TAT CAA TGG GAC AAG TTT GCA TAC CTC TAT GAC AGT GAC AGA 782 
Glu Tyr Tyr Gin Trp Asp Lys Phe Ala Tyr Leu Tyr Aap Ser Asp Arg 
125 130 135 

GGC TTA TCA ACA CTG CAA GCT GTG CTG GAT TCT GCT GCT GAA AAG AAA 830 
Gly Leu Ser Thr Leu Gin Ala Val Leu Asp Ser Ala Ala Glu Lys Lys 
^ 140 145 150 

TGG CAA GTG ACT GCT ATC AAT CTG GGA AAC ATT AAC AAT GAC AAG AAA 878 
Trp Gin Val Thr Ala He Asn Val Gly Asn He Asn Asn Asp Lys Lys 
155 160 165 

GAT GAG ATG TAC CGA TCA CTT TTT CAA GAT CTG GAG TTA AAA AAG GAA 926 
40 Asp Glu Met Tyr Arg Ser Leu Phe Gin Asp Leu Glu Leu Lys Lys Glu 
170 175 180 

CGG CGT GTA ATT CTG GAC TCT . GAA AGG GAT AAA GTA AAC GAC ATT GTA 974 

Arg Arg Val He Leu Asp Cys Glu Arg Asp Lys Val Asn Asp lie Val 
185 190 195 200 

45 

GAC CAG GTT ATT ACC ATT GGA AAA CAC GTT AAA . GGG TAC CAC TAC ATC 1022 

Asp Gin Val He Thr He Gly Lys His Val Lye Gly Tyr His Tyr He 
205 210 215 

ATT GCA AAT CTG GGA TTT ACT GAT GGA GAC CTA TTA AAA ATC CAG TTT 1070 
He Ala Asn Leu Gly Phe Thr Asp Gly Asp Leu Leu Lys He Gin Phe 
50 220 225 230 

GGA GGT GCA AAT GTC TCT GGA TTT CAG ATA GTG GAC TAT GAT GAT TCG 1118 
Gly Gly Ala Asn Val Ser Gly Phe Gin He Val Asp Tyr Asp Aap Ser 
235 240 245 

55 TTG GTA TCT AAA TTT ATA GAA AGA TGG TCA ACA CTG GAA GAA AAA GAA 1166 
Leu Val Ser Lys Phe He Glu Arg Trp Ser Thr Leu Glu Glu Lys Glu 
250 255 260 
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TAC CCT GGA OCT CAC ACA ACA ACA ATT AAG TAT ACT TCT GCT CTG ACC 1214 
5 Tyr Pro Gly Ala His Thr Thr Thr He Lys Tyr Thr Ser Ala Leu Thr 
265 270 275 280 

tat Gat GCC Gtt CAA Gtg Xtg ACT GAA GCC TTC CGC AAC CTA AGG AAG 1262 
Tyr Asp Ala Val Gin Val Met Thr Glu Ala Phe Arg Asn Leu Arg Lys 
285 290 295 

10 CAA AGA ATT GAA ATC TCC CGA AGG GGG AAT GCA GGA GAC TGT CTG GCA 1310 
Gin Arg He Glu He Ser Arg Arg Gly Asn Ala Gly Asp Cys Leu Ala 
300 305 310 

AAC CCA GCA GTG CCC TGG GGA CAA GGT GTA GAA ATA GAA AGG GCC CTC 1358 
Asn Pro Ala Val Pro Trp Gly Gin Gly Val Glu He Glu Arg Ala Leu 
15 315 320 325 

AAA CAG GTT CAG GTT GAA GGT CTC TCA GGA AAT ATA AAG TTT GAC GAG 1406 
Lys .Gin Val Gin Val Glu Gly Leu Ser Gly Asn He Lys Phe Asp Gin 
330 335 340 



20 
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AAT GGA AAA AGA ATA AAC TAT ACA ATT AAC ATC ATG GAG CTC AAA ACT 1454 

Asn Gly Lys Arg He Asn Tyr Thr He Asn He Met Glu Leu Lys Thr 

345 350 355 360 

AAT GGG CCC CGC AAG ATT GGC TAC TGG AGT GAA GTG GAC AAA ATG GTT 1502 

Asn Gly Pro Arg Lys He Cly Tyr Trp Ser Glu Val Asp Lys Met Val 
365 370 375 

GTT ACC CTT ACT GAG CTC CCT TCT GGA AAT GAC ACC TCT GGG CTT GAG 1550 

Val Thr Leu Thr Glu Leu Pro Ser Gly Asn Asp Thr Ser Gly Leu Glu 
380 385 390 

AAT AAG ACT GTT GTT GTC ACC ACA ATT TTG GAA TCT CCG TAT GTT ATG 1598 

Asn Lya Thr Val Val Val Thr Thr He Leu Glu Ser Pro Tyr Val Met 
395 400 405 

ATG AAG AAA AAT CAT GAA ATG CTT GAA GGC AAT GAG CGC TAT GAG GGC 1646 

Met Lys Lys Asn His Glu Met Leu Glu Gly Asn Glu Arg Tyr Glu Glv 
410 415 420* 

35 TAC TGT GTT GAC CTG OCT GCA GAA ATC GCC AAA CAT TGT GGG TTC .AAG 1694 

Tyr Cys Val Asp Leu Ala Ala Glu lie Ala Lys His Cys Glv Phe Lvs 

«5 430 435 440 

TAC AAG TTG ACA ATT GTT GGT GAT GGC AAG TAT GGG GCC AGG GAT GCA 1742 

Tyr Lys Leu Thr He Val Gly Asp Gly Lys Tyr Gly Ala Arg Asp Ala 
445 450 455 
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GAC ACG AAA ATT TGG AAT GGG ATG GTT GGA GAA CTT GTA TAT GGG AAA 1790 

Asp Thr Lya He Trp Asn Gly Met Val Gly Glu Leu Val Tyr Gly Lys 
460 465 470 

GCT GAT ATT GCA ATT GCT CCA TTA ACT ATT ACC CTT GTG AGA GAA GAG 1838 

Ala Asp He Ala He Ala Pro Leu Thr He Thr Leu Val Arg Glu Glu 
475 480 485 

GTG ATT GAC TTC TCA AAG CCC TTC ATG AGC CTC GGG ATA TCT ATC ATG 1886 

Val He Asp Phe Ser Lys Pro Phe Met Ser Leu Gly He Ser He Met 
490 495 500 

ATC AAG AAG CCT CAG AAG TCC AAA CCA GGA GTG TTT TCC TTT CTT GAT 1934 

He Lys Lya Pro Gin Lys Ser Lys Pro Gly Val Phe Ser Phe Leu Asp 

505 510 515 520 

CCT TTA GCC TAT GAG ATC TGG ATG TGC ATT GTT TTT GCC TAC ATT GGG 1982 

Pro Leu Ala Tyr Glu He Trp Met Cys He Val Phe Ala Tyr He Gly 
52S 530 535 
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5 CTC AGT OTA GTT TTA TTC CTC GTC AGC AGA TTT AGC CCC TAC GAG TGG 2030 

Val Ser Val Val Leu Phe Leu Val Ser Arg Phe Ser Pro Tyr Glu Trp 

540 545 550 

CAC ACT GAG GAG TTT GAA GAT GGA AGA CAA ACA CAA AGT AGT CAA TCA 2078 
His Thr Glu Glu Phe Glu Aap Gly Arg Glu Thr Gin Ser Ser Glu Ser 
10 555 560 565 

ACT AAT GAA TTT GGG ATT TTT AAT AGT CTC TGG TTT TCC TTG GGT GCC 2126 
Thr Aan Glu Phe Gly He Phe Aen ser Leu Trp Phe Ser Leu Gly Ala 
570 575 580 

TTT ATG CGG CAA GGA TGC GAT ATT TCG CCA AGA TCC CTC TCT GGG CGC 2174 
15 Phe Met Arg Gin Gly Cys Asp He Ser Pro Arg Ser Leu Ser Gly Arg 
585 590 59S 600 

ATT GTT GGA GGT GTG TGG TGG TTC TTT ACC CTG ATC ATA ATC TCC TCC 2222 

He Val Gly Gly Val Trp Trp Phe Phe Thr Leu He He He Ser Ser 
605 610 615 

20 

TAC ACG GCT AAC TTA CCT GCC TTC CTG ACT GTA GAG AGG ATG GTG TCT 2270 

Tyr Thr Ala Aan Leu Ala Ala Phe Leu Thr Val Glu Arg Met Val Ser 
620 625 630 

CCC ATC GAA AGT CCT GAG GAT CTT TCT AAG CAA ACA GAA ATT GCT TAT 2318 
Pro He Glu Ser Ala Glu Asp Leu Ser Lys Gin Thr Glu He Ala Tyr 
25 635 640 645 

GGA ACA TTA GAC TCT GGC TCC ACT AAA GAG TTT TTC AGG AGA TCT * AAA 2366 
Gly Thr Leu Asp Ser Gly Ser Thr Lys Glu Phe Phe Arg Arg Ser Lyb 
650 655 660 

ATT GCA GTG TTT CAT AAA ATG TGG ACC TAC ATG CGC AGT CCC GAG CCC 2414 
30 He Ala Val Phe Asp Lys Met Trp Thr Tyr Met Arg Ser Ala Glu Pro 
665 670 675 680 

TCT GTG TTT GTG AGG ACT ACG GCC GAA GGG GTG GCT AGA GTG CGG AAG 2462 
Ser Val Phe Val Arg Thr Thr Ala Glu Gly Val Ala Arg Val Arg Lys 
685 690 695 

35 TCC AAA GGG AAA TAT GCC TAC TTG TTG GAG TCC ACG ATG AAC GAG TAC 2510 
Ser Lys Gly Lys Tyr Ala Tyr Leu Leu Glu Ser Thr Met Asn Glu Tvr 
700 705 710 

ATT GAG CAA AGG AAG CCT TGC GAC ACC ATG AAA GTT GGT GGA AAC CTG 2558 
He Glu Gin Arg Lys Pro Cys Asp Thr Met Lys Val Gly Gly Aan Leu 
40 715 7 20 725 

GAT TCC AAA GGC TAT GGC ATC GCA ACA CCT AAA GGA TCC TCA TTA GGA 2606 
Asp Ser Lys Gly Tyr Gly lis Ala Thr Pro Lys Gly Ser Ser Leu Glv 
730 735 740 

ACC CCA GTA AAT CTT GCA GTA TTG AAA CTC AGT GAG CAA GGC GTC TTA 2654 
Thr Pro Val Asn Leu Ala Val Leu Lys Leu Ser Glu Gin Gly Val Leu 
745 750 755 760 

GAC AAG CTG AAA AAC AAA TGG TGG TAC GAT AAA GGT GAA TGT GGA GCC 2702 
Asp Lya Leu Lys Aan Lye Trp Trp Tyr Asp Lys Gly Glu Cys Gly Ala 
765 770 775 

AAG GAC TCT GGA AGT AAG GAA AAG ACC AGT GCC CTC AGT CTG ACC AAC 2750 
Lye Aap Ser Gly Ser Lys Glu Lys Thr Ser Ala Leu Ser Leu Ser Asn 
780 785 790 

GTT GCT GGA GTA TTC TAC ATC CTT GTC GGG GGC CTT GGT TTG GCA ATG 2798 
Val Ala Gly Val Phe Tyr He Leu Val Gly Gly Leu Gly Leu Ala Met 
55 795 800 80S 
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CTG GTG GCT TTG ATT GAG TTC TGT TAC AAG TCA AGG GCC GAG GCG AAA 2846 
Leu Val Ala Leu lie Glu Phe Cya Tyr Lya Ser Arg Ala Glu Ala Lye 
810 815 820 

CGA ATG AAG GTG GCA AAG AAT GCA CAG AAT ATT AAC CCA TCT TCC TCG 2894 
Arg Met Lye Val Ala Lya Asn Ala Gin Aan lie Aan Pro Ser Ser Ser 
825 830 835 840 

CAG AAT TCA CAG AAT TTT GCA ACT TAT AAG GAA GGT TAC AAC GTA TAT 2942 
Gin Asn Ser Gin Aen Phe Ala Thr Tyr Lya Glu Gly Tyr Aan Val Tyr 
845 850 855 

GGC ATC GAA AGT GTT AAA ATT TAGGGGATGA CCTTGAATGA TGCCATGAGG 2993 
Gly He Glu Ser Val Lya He 
860 



AACAAGGCAA GCCTGTCAAT TACAGGAAGT ACTGGAGAAA ATGGACGTGT TATGACTCCA 3053 

GAATTTCCCA AAGCNGTGCA TGCTGTCCCT TACGTGAGTC CTGGCATGGG AATGAATGTC 3113 

20 AGTGTGACTG ATCTCTCGTG ATTGATAAGA ACCTTTTGAG TGCCTTACAC AATGGTTTTC 3173 

TTGTGTGTTT ATTGTCAAAG TGGTGAGAGG CATCCAGTAT CTTGAAGACT TTTCTTTCAG 3233 
CCAAGAATTC TTAAATATGT GGAGTTCATC TTGAATTGTA AGGAATGATT AATTAAAACA . 3293 

25 CAACATCTTT TTCTACTCGA GTTACAGACA AAGCGTGGTG GACATGCACA GCTAACATGG 3353 

AAGTACTATA ATTTACCTGA AGTCTTTGTA CAGACAACAA ACCTGTTTCT GCAG 3407 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS x 

(A) LENGTH: 883 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) molecule TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gin Lya He Met Hia He Ser Val Leu Leu Ser Pro Val Leu Trp 
-20 -15 -10 -5 

Gly Leu He Phe Gly Val Ser Ser Aan Ser He Gin He Gly Gly Leu 
1 5 10 

Phe Pro Arg Gly Ala Asp Gin Glu Tyr Ser Ala Phe Arg Val Gly Met 
15 20 25 

Val Gin Phe Ser Thr Ser Glu Phe Arg Leu Thr Pro Hia He Aap Aan 
45 30 35 40 

Leu Glu Val Ala Asn Ser Phe Ala Val Thr Aan Ala Phe Cya Ser Gin 
45 50 55 60 

Phe Ser Arg Gly Val Tyr Ala He Phe Gly Phe Tyr Asp Lya Lya Ser 
50 65 70 75 

Val Aan Thr He Thr Ser Phe Cya Gly Thr Leu Hia Val Ser Phe He 
80 85 90 

Thr Pro Ser Phe Pro Thr Aap Gly Thr Hia Pro Phe Val He Gin Met 
95 100 105 
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Arg Pro Asp Leu Lys Gly Ala Leu Leu Ser Leu He Glu Tyr Tyr Gin 
110 ^ H5 120 

Trp Asp Lys Phe Ala Tyr Leu Tyr Asp Ser Asp Arg Gly Leu Ser Thr 
125 130 135 14Q 

Leu Gin Ala Val Leu Asp Ser Ala Ala Glu Lys Lys Trp Gin Val Thr 
145 iso i5 5 

Ala He Asn Val Gly Asn He Asn Asn Asp Lys Lys Asp Glu Met Tyr 
I 60 165 170 

Arg Ser Leu Phe Gin Asp Leu Glu Leu Lye Lys Glu Arg Arg Val He 
I 75 180 185 

Leu Asp Cys Glu Arg Asp Lys Val Asn Asp He Val Asp Gin Val He 
190 195 200 

Thr He Gly Lys His Val Lys Gly Tyr His Tyr He He Ala Aen Leu 
205 210 215 220 

Gly Phe Thr Asp Gly Asp Leu Leu Lya He Gin Phe Gly Gly Ala Asn 
225 230 7 235 

Val Ser Gly Phe Gin He Val Asp Tyr Asp Asp Ser Leu Val Ser Lys 
240 245 250 

Phe lie Glu Arg Trp Ser Thr Leu Glu Glu Lya Glu Tyr Pro Gly Ala 
255 260 265 

His Thr Thr Thr He Lys Tyr Thr Ser Ala Leu Thr Tyr Asp Ala Val 
270 275 280 

Gin Val Met Thr Glu Ala Phe Arg Asn Leu Arg Lys Gin Arg He Glu 
285 290 295 300 

He Ser Arg Arg Gly Asn Ala Gly Asp Cya Leu Ala Asn Pro Ala Val 
305 310 3is 

Pro Trp Gly Gin Gly Val Glu He Glu Arg Ala Leu Lya Gin Val Gin 
320 325 330 

Val Glu Gly Leu Ser Gly Aan He Lya Phe Asp Gin Asn Gly Lys Arg 
335 340 345 

He Asn Tyr Thr He Asn He Met Glu Leu Lya Thr Asn Gly Pro Ara 
350 355 360 

Lya He Gly Tyr Trp Ser Glu Val Asp Lya Met Val Val Thr Leu Thr 
365 370 375 380 

Glu Leu Pro Ser Gly Aan Asp Thr Ser Gly Leu Glu Asn Lys Thr Val 
385 390 395 

Val val Thr Thr He Leu Glu Ser Pro Tyr Val Met Met Lya Lys Asn 
400 405 410 

His Glu Met Leu Glu Gly Asn Glu Arg Tyr Glu Gly Tyr Cys Val Asd 
415 420 425 

Leu Ala Ala Glu He Ala Lya His Cys Gly Phe Lya Tyr Lya Leu Thr 
430 435 440 

He Val Gly Asp Gly Lys Tyr Gly Ala Arg Asp Ala Asp Thr Lya He 
445 450 455 460 
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Trp Aan Gly Met Val Gly Glu Leu Val Tyr Gly Lya Ala Aap He Ala 
465 470 475 

He Ala Pro Leu Thr He Thr Leu Val Arg Glu Glu Val He Asp Phe 
480 485 490 

Sar Lys Pro Phe Met Ser Leu Gly He Ser He Met He Lye Lya Pro 
495 500 505 

Gin Lya Ser Lya Pro Gly Val Phe Ser Phe Leu Asp Pro Leu Ala Tyr 
510 515 520 

Glu He Trp Met Cya He Val Phe Ala Tyr He Gly Val Ser Val Val 
525 530 535 540 

Leu Phe Leu Val Ser Arg Phe Ser Pro Tyr Glu Trp Hia Thr Glu Glu 
545 550 555 

Phe Glu Aap Gly Arg Glu Thr Gin Ser Ser Glu Ser Thr Aan Glu Phe 
560 565 570 

Gly He Phe Aan Ser Leu Trp Phe Ser Leu Gly Ala Phe Met Arg Gin 
575 580 585 

Gly Cya Aap He Ser Pro Arg Ser Leu Ser Gly Arg He Val Gly Gly 
590 595 600 

Val Trp Trp Phe Phe Thr Leu He He lie Ser Ser Tyr Thr Ala Aan 
60S 610 615 620 

Leu Ala Ala Phe Leu Thr Val Glu Arg Met Val Ser Pro He Glu Ser 
625 630 635 

Ala Glu Aap Leu Ser Lya Gin Thr Glu He Ala Tyr Gly Thr Leu Asp 
640 645 650 

Ser Gly Ser Thr Lya Glu Phe Phe Arg Arg Ser Lya He Ala Val Phe 
655 660 665 

35 Aap Lya Met Trp Thr Tyr Met Arg Ser Ala Glu Pro Ser Val Phe Val 

670 675 680 

Arg Thr Thr Ala Glu Gly Val Ala Arg Val Arg Lya Ser Lya Gly Lya 
685 690 695 700 

40 Tyr Ala Tyr Leu Leu Glu Ser Thr Met Aan Glu Tyr He Glu Gin Arg 

705 710 715 

Lya Pro Cya Aap Thr Met Lys Val Gly Gly Aan Leu Aap Ser Lya Gly 
720 725 730 

45 Tyr Gly He Ala Thr Pro Lys Gly Ser Ser Leu Gly Thr Pro Val Aan 

735 740 745 

Leu Ala Val Leu Lya Leu . Ser Glu Gin Gly Val Leu Aap Lya Leu Lya 
750 755 760 

50 Aan Lya Trp Trp Tyr Aap Lya Gly Glu Cya Gly Ala Lya Asp Ser Gly 

765 770 775 780 

Ser Lya Glu Lys Thr Ser Ala Leu Ser Leu Ser Aan Val Ala Gly Val 
785 790 795 

55 Phe Tyr He Leu Val Gly Gly Leu Gly Leu Ala Met Leu Val Ala Leu 

800 805 810 
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He Glu Phe Cya Tyr Lys Ser Arg Ala Glu Ala Lya Arg Met Lye Val 
81 5 820 825 

Ala LVS AST! Al A Gin Ann Tla &an Dr<> Ca* o » 

- " — ~ ««4. /VOil iJCX- VjfJ.Il 

83 ° 835 840 

Aan Phe Ala Thr Tyr Lye Glu Gly Tyr Aan Val Tyr Gly Ila Glu Ser 
10 850 855 860 

Val Lya lie 



15 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2761 base pairs 

(B) TYPE* nucleic acid 

(C) STRAND EDNESS t double 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : sig peptide 

(B) LOCATION: 79.. 144 

25 (ix) FEATURE: 

(A) NAME /KEY: mat peptide 

(B) LOCATION: 145.. 2745 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
30 (B) LOCATION: 79.. 2745 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GAATTCCTGA CGACTCCTCA GTTGCGCCCA TGCTCTTGTC AGCTTCGTTT TAGGCGTAGC 60 

35 ATGGCCAGGC AGAAGAAA ATG GGG CAA AGC GTG CTC CGG GCG GTC TTC TTT 111 

Met Gly Gin Ser Val Leu Arg Ala Val Phe Phe 
-22 -20 -is 

TTA GTC CTG GGG CTT TTG GGT CAT TCT CAC GGA GGA TTC CCC AAC ACC 159 
Leu *Val Leu Gly Leu Leu Gly Hia Ser His Gly Gly Phe Pro Aan Thr 
40 -10 -5 15 

ATC AGC ATA GGT GGA CTT TTC ATG AGA AAC ACA GTG CAC GAG CAC AGC 207 
He Ser He Gly Gly Leu Phe Met Arg Aan Thr Val Gin Glu Hia Ser 
10 15 20 

GCT TTC CCC TTT GCC GTG CAG TTA TAC AAC ACC AAC GAG AAC ACC ACC 255 
Ala Phe Arg Phe Ala Val Gin Leu Tyr Aan Thr Aan Gin Aan Thr Thr 
2S 30 35 

GAG AAC CCC TTC CAT TTG AAT TAC CAC GTA GAT CAC TTC GAT TCC TCC 303 
Glu Lya Pro Phe Hie Leu Aan Tyr His Val Asp Hia Leu Asp Ser Ser 
40 45 50 

AAT ACT TTT TCC GTG ACA AAT GCT TTC TCC TCC CAG TTC TCC AGA GGG 351 
Aan Ser Phe Ser Val Thr Aan Ala Phe Cye Ser Gin Phe Ser Arg Gly 

55 60 - 65 

GTG TAT GCC ATC TTT GGA TTC TAT GAC CAC ATG TCA ATG AAC ACC CTG 399 
Val Tyr Ala He Phe Gly Phe Tyr Aep Gin Met Ser Met Aan Thr Leu 
70 75 80 85 
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ACC TCC TTC TGT GGG GCC CTG CAC ACA TCC TTT GTT ACG CCT AGC TTC 447 
Thr Ser Phe Cys Gly Ala Leu Hia Thr Ser Phe Val Thr Pro Ser Phe 
90 95 100 

CCC ACT GAC GCA GAT CTG CAG TTT GTC ATC CAG ATG CGC CCA GCC TTG 495 
Pro Thr Asp Ala Asp Val Gin Phe Val He Gin Met Arg Pro Ala Leu 
105 no us 

AAG GGC GCT ATT CTG ACT CTT CTG GGT CAT TAC AAG TGG GAG AAG TTT 543 
Lye Gly Ala He Leu Ser Leu Leu Gly His Tyr Lye Trp Glu Lye Phe 
120 125 130 

GTG TAC CTC TAT GAC ACA GAA CGA GGA TTT TCC ATC CTC CAA GCG ATT 591 
Val Tyr Leu Tyr Asp Thr Glu Arg Gly Phe Ser He Leu Gin Ala He 
135 140 145 

ATG GAA GCA GCA GTC CAA AAC AAC TGG CAA GTA ACA GCA AGC TCT GTG 639 
Met Glu Ala Ala Val Gin Asn Aan Trp Gin Val Thr Ala Arg Ser Val 
150 155 160 165 . 

GGA AAC ATA AAG GAC GTC CAA GAA- TTC AGG CGC ATC ATT GAA GAA ATG 687 
Gly Asn He Lye Asp Val Gin Glu Phe Arg Arg Ha He Glu Glu Met 
170 175 iso 

GAC AGG AGG CAG GAA AAG CGA TAC TTG ATT GAC TGC GAA GTC GAA AGG 735 
Asp Arg Arg Gin Glu Lya Arg Tyr Leu He Asp Cys Glu Val Glu Aro 
185 190 195 . * 

ATT AAC ACA ATT TTG CAA CAG GTT GTG ATC CTA GGG AAA CAC TCA AGA 783 
He Aan Thr He Leu Glu Gin Val Val He Leu Gly Lya Hia Ser Aro 
200 205 210 

GGT TAT CAC TAC ATC CTC GCT AAC CTG GCT TTT ACT GAT ATT TTA CTG 831 
Gly Tyr Hia Tyr Met Leu Ala Aan Leu Gly Phe Thr Aap He Leu Leu 
215 220 225 

CAA AGA GTC ATG CAT GGG GGA GCC AAC ATT ACA GGT TTC CAG ATT GTC 879 
Glu Arg Val Met Hia Gly Gly Ala Aan He Thr Gly Phe Gin He Val 
230 235 240 24S 

AAC AAT GAA AAC CCT ATG GTT CAG CAG TTC ATA CAG CGC TGG GTG AGG 927 
Asn Asn Glu Aan Pro Met Val Gin Gin Phe He Gin Arg Trp Val Aro 
250 255 260 

CTG GAT GAA AGG CAA TTC CCT GAA GCC AAG AAT GCA CCA CTA AAG TAT 975 
Leu Aap Glu Arg Glu Phe Pro Glu Ala Lya Aan Ala Pro Leu Lye Tyr 
40 265 270 275 

ACA TCT GCA TTG ACA CAC GAC GCA ATA CTG GTC ATA GCA GAA GCT TTC 1023 
Thr Ser Ala Leu Thr Hia Aap Ala He Leu Val He Ala Glu Ala Phe 
280 285 290 

45 CGC TAC CTG AGG AGG CAG CGA GTA GAT CTC TCC CGG AGA GGA AGT GCT 1071 
Arg Tyr Leu Arg Arg Gin Arg Val Aap Val Ser Arg Arg Gly Ser Ala 
295 300 305 

GGA GAC TGC TTA GCA AAT CCT GCT GTG CCC TGG AGT CAA GCA ATT GAT 1119 
Gly Aap Cya Leu Ala Aan Pro Ala Val Pro Trp Ser Gin Gly He Aso 

50 310 315 320 - 325 

ATT GAG AGA GCT CTG AAA ATG GTG CAA GTA CAA GCA ATG ACT GGA AAT 1167 
He Glu Arg Ala Leu Lya Met Val Gin Val Gin Gly Met Thr Gly Aan 
330 335 340 

ATT CAA TTT GAC ACT TAT GGA CGT AGG ACA AAT TAT ACC ATC GAT GTG 1215 
55 He Gin Phe Aap Thr Tyr Gly Arg Arg Thr Aan Tyr Thr He Asp Val 
345 350 355 
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TAT GAA ATG AAA GTC AGT GGC TCT CGA AAA GCT GGC TAG TGG AAC GAG 1263 
Tyr Glu Met Lya Val Ser Gly Ser Arg Lya Ala Gly Tyr Trp Aan Glu 
360 365 370 

TAT GAA AGO TTT GTG OCT TTC TCA GAT CAG CAA ATC AGC AAT GAC AGT 1311 
Tyr Glu Arg Phe Val Pro Pha Ser ABp Gin Gin lie Ser Asn Aap Ser 
375 380 385 

GCA TCC TCA GAG AAT CGG ACC ATA GTA GTG ACT ACC ATT CTG GAA TCA 1359 
Ala Ser Ser Glu Aan Arg Thr He Val Val Thr Thr He Leu Glu Ser 
390 395 400 405 

CCA TAT GTA ATG TAG AAG AAG AAC CAT GAG CAA CTG GAA GGA AAT GAA 1407 
Pro Tyr Val Met Tyr Lya Lya Asn Hia Glu Gin Leu Glu Gly Aan Glu 
410 415 420 

CGA TAT GAA GGC TAT TGT GTA GAC CTA GCC TAT GAA ATA GCC AAA CAT 1455 
Arg Tyr Glu Gly Tyr Cys Val Aap Leu Ala Tyr Glu He Ala Lys Hia 
425 - 430 435 

20 GTA AGG ATC AAA TAC AAA TTG TCC ATC GTT GGT GAC GGG AAA TAT GGT 1503 
Val Arg He Lya Tyr Lya Leu Ser He Val Gly Aap Gly Lya Tyr Gly 
440 445 450 

GCA AGG GAT CCA GAG ACT AAA ATA TGG AAC GGC ATG GTT GGG GAA CTT 1551 
Ala Arg Aap Pro Glu Thr Lys He Trp Asn Gly Met Val Gly Glu Leu 
25 455 460 465 

GTC TAT GGG AGA GCT GAT ATA GCT GTT GCT CCA CTC ACT ATA ACA TTG 1599 
Val Tyr Gly Arg Ala Aap He Ala Val Ala Pro Leu Thr He Thr Leu 
470 475 480 485 
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GTC CGT GAA GAA GTC ATA GAT TTT TCA AAG CCA TTA ATG AGC CTG GGC 1647 
Val Arg Glu Glu Val He Aap Phe Ser Lys Pro Leu Met Ser Leu Gly 
490 495 500 

ATC TCC ATC ATG ATA AAG AAG CCT CAG AAA TCA AAA CCA GCC GTA TTC 1695 
He Ser He Met He Lys Lys Pro Gin Lya Ser Lya Pro Gly Val Phe 
505 510 515 

TCA TTT CTG GAT CCC CTG GCT TAT GAA ATC TGG ATG TGC ATT GTC TTT 1743 
Ser Phe Leu Aap Pro Leu Ala Tyr Glu He Trp Met Cys He Val Phe 
520 525 530 

GCT TAC ATT GGA GTC AGC GTA GTT CTT TTC CTA GTC AGC AGG TTC AGT 1791 
40 Ala Tyr He Gly Val Ser Val Val Leu Phe Leu Val Ser Arg Phe Ser 
535 540 545 

CCT TAT GAA TGG CAC TTG GAA GAC AAC AAT GAA GAA CCT CGT GAC CCA 1839 
Pro Tyr Glu Trp Hia Leu Glu Asp Asn Asn Glu Glu Pro Arg Asp Pro 
550 555 560 565 

45 CAA AGT CCT CCT GAT CCT CCA AAT GAA TTT GGA ATA TTT AAC AGT CTT 1887 
Gin Ser Pro Pro Asp Pro Pro Asn Glu Phe Gly He Phe Asn Ser Leu 
570 575 580 

TGG TTT TCC TTG GGT GCC TTT ATG CAG CAA GGA TGT GAT ATT TCT CCA 1935 
Trp Phe Ser Leu Gly Ala Phe Met Gin Gin Gly Cys Asp He Ser Pro 
SO 585 590 595 

AGA TCA CTC TCC GGG CGC ATT GTT GCA GGG GTT TGG TGG TTC TTC ACC 1983 
Arg Ser Leu Ser Gly Arg He Val Gly Gly Val Trp Trp Phe Phe Thr 
600 605 610 

55 CTG ATC ATA ATT TCT TCC TAT ACT GCC AAT CTC GCT GCT TTC CTG ACT 2031 
Leu He He He Ser Ser Tyr Thr Ala Aan Leu Ala Ala Phe Leu Thr 
615 620 625 
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GTG GAG AGG ATG GTT TCT CCC ATA GAG AGT GCT GAA GAC TTA GCT AAA 2079 
5 Val Glu Arg Met Val Ser Pro lie Glu Ser Ala Glu Asp Leu Ala Lya 
630 635 640 645 

CAG ACT GAA ATT GCA TAT GGC ACC CTG GAC TCC GGT TCA ACA AAA GAA 2127 
Gin Thr Glu lie Ala Tyr Gly Thr Leu Asp Ser Gly Ser Thr Lys Glu 
650 655 660 

10 TTT TTC AGA AGA TCC AAA ATT GCT GTG TAC GAG AAA ATG TGG TCT TAC 2175 
Phe Phe Arg Arg Ser Lya He Ala Val Tyr Glu Lys Met Trp Ser Tyr 
665 670 675 



15 



ATG AAA TCA GCG GAG CCA TCT GTG TTT ACC AAA ACA ACA GCA GAC GGA 2223 
Met Lys Ser Ala Glu Pro Ser Val Phe Thr Lys Thr Thr Ala Asp Gly 
680 665 690 

GTG GCC CGA GTG CGA AAG TCC AAG GGA AAG TTC GCC TTC CTG CTG GAG 2271 
Val Ala Arg Val Arg Lys Ser Lys Gly Lys Phe Ala Phe Leu Leu Glu 
695 700 705 

20 TCA ACC ATG AAT GAG TAC ATT GAG CAG AGA AAA CCA TGT GAT ACG ATG 2319_ 
Ser Thr Met Asn Glu Tyr He Glu Gin Arg Lys Pro Cya Asp Thr Met 
710 715 720 725 

AAA GTT GGT GCA AAT CTG GAT TCC AAA GGC TAT GGT GTG GCA ACC CCT 2367 
Lys Val Gly Gly Asn Leu Asp Ser Lys Gly Tyr Gly Val Ala Thr Pro 
25 730 735 740 

AAA GGC TCA GCA TTA GGA AAT GCT GTT AAC CTG GCA GTA TTA AAA CTG 2415 
Lys Gly Ser Ala Leu Gly Asn Ala Val Asn Leu Ala Val Leu Lys Leu 
745 750 755 

AAT GAG CAA GGC CTC TTG GAC AAA TTG AAA AAC AAA TGG TGG TAC GAC 2463 
Asn Glu Gin Gly Leu Leu Asp Lys Leu Lye Asn Lys Trp Trp Tyr Asp 
760 765 770 

AAA GGA GAG TCC GGC AGC GGG GGC GGT GAC TCC AAG GAC AAG ACC AGC 2S11 
Lys Gly Glu Cya Gly Ser Gly Gly Gly Asp Ser Lys Asp Lys Thr Ser 
775 780 785 

GCT CTG AGC CTG AGC AAT GTG GCA GGC GTT TTC TAT ATA CTT GTC GGA 2559 
Ala Leu Ser Leu Ser Asn Val Ala Gly Val Phe Tyr He Leu Val Gly 
790 795 800 805 

GGT CTG GGG CTG GCC ATG ATG GTG GCT TTG ATA GAA TTC TGT TAC AAA 2607 
Gly Leu Gly Leu Ala Met Met Val Ala Leu He Glu Phe Cys Tyr Lys 
810 815 820 

TCA CGG GCA GAG TCC AAA CGC ATG AAA CTC ACA AAG AAC ACC CAA AAC 2655 
Ser Arg Ala Glu Ser Lya Arg Met Lys Leu Thr Lys Asn Thr Gin Asn 
825 830 835 

45 TTT AAG CCT GCT CCT GCC ACC AAC ACT CAG AAT TAT GCT ACA TAC AGA 2703 
Phe Lys Pro Ala Pro Ala Thr Asn Thr Gin Asn Tyr Ala Thr Tyr Arg 
840 845 850 

GAA GGC TAC AAC GTG TAT GGA ACA GAG ACT GTT AAG ATC TAGGGATCCC 2752 
Glu Gly Tyr Asn Val Tyr Gly Thr Glu Ser Val Lys He 
50 855 860 865 

TTGGAATTC 2761 



30 



35 



40 



55 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 amino acids 

( B) TYPE: amino acid 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Gly Gin Ser Val Leu Arg Ala Val Phe Phe Leu Val Leu Glv Leu 
-22 -20 -15 -io 1 

Leu Gly His Ser Hia Gly Gly Phe Pro Asn Thr lie Ser He Gly Gly 
- 5 1 5 io 

Leu Phe Met Arg Asn Thr Val Gin Glu Hia Ser Ala Phe Arg Phe Ala 
15 20 25 

Val Gin Leu Tyr Aan Thr Aan Gin Aan Thr Thr Glu Lye Pro Phe Hia 
30 35 40 

Leu Asn Tyr His Val Aap His Leu Asp Ser Ser Aan Ser Phe Ser Val 
45 50 55 

Thr Asn Ala Phe Cya Ser Gin Phe Ser Arg Gly Val Tyr Ala He Phe 
60 65 70 

Gly Phe Tyr Asp Gin Met Ser Met Aan Thr Leu Thr Ser Phe Cya Gly 
75 80 85 90 

Ala Leu Hia Thr Ser Phe Val Thr Pro Ser Phe Pro Thr Asp Ala Aap 
95 100 105 

Val Gin Phe Val He Gin Met Arg Pro Ala Leu Lys Gly Ala He Leu 
110 115 120 

Ser Leu Leu Gly His Tyr Lya Trp Glu Lys Phe Val Tyr Leu Tyr Asp 
125 130 135 

Thr Glu Arg Gly Phe Ser He Leu Gin Ala He Met Glu Ala Ala Val 
140 145 150 

Gin Asn Asn Trp Gin Val Thr Ala Arg Ser Val Gly Aan He Lya Asp 
155 160 165 170 

Val Gin Glu Phe Arg Arg He He Glu Glu Met Aap Arg Arg Gin Glu 
175 180 185 

Lya Arg Tyr Leu He Aap Cya Glu Val Glu Arg He Asn Thr He Leu 
190 195 200 

Glu Gin Val Val He Leu Gly Lya Hia Ser Arg Gly Tyr His Tyr Met 
205 210 215 

Leu Ala Asn Leu Gly Phe Thr Aap He Leu Leu Glu Arg Val Met His 
220 225 230 

Gly Gly Ala Asn He Thr Gly Phe Gin He Val Aan Aan Glu Asn Pro 
235 240 245 250 

Met Val Gin Gin Phe He Gin Arg Trp Val Arg Leu Aap Glu Arg Glu 
255 260 265 

Phe Pro Glu Ala Lya Aan Ala Pro Leu Lys Tyr Thr Ser Ala Leu Thr 
270 275 280 
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His Asp Ala He 
285 



Leu Val He Ala Glu Ala Phe Arg Tyr Leu Arg Arg 
290 295 



Gin Arg Val Asp Val Ser Arg Arg Gly Ser Ala Gly Asp Cys Leu Ala 
300 305 310 

Asn Pro Ala Val Pro Trp Ser Gin Gly He Asp He Glu Arg Ala Leu 
315 320 325 330 

Lys Met Val Gin Val Gin Gly Met Thr Gly Asn He Gin Phe Asp Thr 
335 340 345 

Tyr Gly Arg Arg Thr Asn Tyr Thr He Asp Val Tyr Glu Met Lye Val 
350 355 360 

Ser Gly Ser Arg Lya Ala Gly Tyr Trp Asn Glu Tyr Glu Arg Phe Val 
365 370 375 

Pro Phe Ser Asp Gin Gin He Ser Aan Asp Ser Ala Ser Ser Glu Asn 
380 385 390 

Arg Thr He Val Val Thr Thr He Leu Glu Ser Pro Tyr Val Met Tyr 
395 400 405 410 

Lys Lys Asn His Glu Gin Leu Glu Gly Asn Glu Arg Tyr Glu Gly Tyr 
415 420 425 

Cys Val Asp Leu Ala Tyr Glu He Ala Lys His Val Arg He Lys Tyr 
430 435 440 

Lys Leu Ser He Val Gly Asp Gly Lys Tyr Gly Ala Arg Asp Pro Glu 
445 450 455 

Thr Lys He Trp Asn Gly Met Val Gly Glu Leu Val Tyr Gly Arg Ala 
460 465 470 

Asp He Ala Val Ala Pro Leu Thr He Thr Leu Val Arg Glu Glu Val 
475 480 485 490 

He Asp Phe Ser Lys Pro Leu Met Ser Leu Cly He Ser He Met He 
495 500 505 

Lys Lys Pro Gin Lys Ser Lys Pro Gly Val Phe Ser Phe Leu Asp Pro 
510 515 520 

Leu Ala Tyr Glu He Trp Met Cys He Val Phe Ala Tyr He Gly Val 
525 530 535 

Ser Val Val Leu Phe Leu Val Ser Arg Phe Ser Pro Tyr Glu Trp His 
540 545 550 

Leu Glu Asp Asn Asn Glu Glu Pro Arg Asp Pro Gin Ser Pro Pro Asp 
555 560 565 570 

Pro Pro Aon Glu Phe Gly He Phe Asn Ser Leu Trp Phe Ser Leu Gly 
575 580 585 

Ala Phe Met Gin Gin Gly Cys Asp He Ser Pro Arg Ser Leu Ser Gly 
590 595 600 

Arg He Val Gly Cly Val Trp Trp Phe Phe Thr Leu He He He Ser 
605 610 615 

Ser Tyr Thr Ala Asn Leu Ala Ala Phe Leu Thr Val Glu Arg Met Val 



620 



625 



630 
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Ser Pro lie Glu Ser Ala Glu Aap Leu Ala Lys Gin Thr Glu He Ala 
635 640 645 650 

s Tyr Gly Thr Leu Asp Ser Gly Ser Thr Lye Glu Phe Phe Arg Arg Ser 

655 660 665 

Lys He Ala Val Tyr Glu Lye Met Trp Ser Tyr Met Lye Ser Ala Glu 
670 675 630 

w Pro Ser Val Phe Thr Lys Thr Thr Ala Asp Gly Val Ala Arg Val Arg 

685 690 695 

Lys Ser Lys Gly Lys Phe Ala Phe Leu Leu Glu Ser Thr Met Asn Glu 
700 705 710 

15 Tyr He Glu Gin Arg Lys Pro Cys Asp Thr Met Lys Val Gly Gly Asn 

715 720 725 730 

Leu Asp Ser Lys Gly Tyr Gly Val Ala Thr Pro Lye Gly Ser Ala Leu 
735 740 745 

20 <*ly Asn Ala Val Asn Leu Ala Val Leu Lys Leu Asn Glu Gin Gly Leu 

750 755 760 

Leu Asp Lys Leu Lys Asn Lys Trp Trp Tyr Asp Lys Gly Glu Cys Gly 
765 770 775 

25 Ser Gly Gly Gly Asp Ser Lys Asp Lys Thr Ser Ala Leu Ser Leu Ser 

780 785 790 

Asn Val Ala Gly Val Phe Tyr He Leu Val Gly Gly Leu Gly Leu Ala 
795 800 805 810 



30 
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40 



50 
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Met Met Val Ala Leu He Glu Phe Cys Tyr Lys Ser Arg Ala Glu Ser 
815 820 825 

Lys Arg Met Lys Leu Thr Lys Asn Thr Gin Asn Phe Lys Pro Ala Pro 
830 835 840 

Ala Thr Asn Thr Gin Asn Tyr Ala Thr Tyr Arg Glu Gly Tyr Asn Val 
845 850 855 

Tyr Gly Thr Glu Ser Val Lys He 
860 865 



(2) INFORMATION FOR SEQ ID NO; 7: 

(1} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3070 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cONA 



( ix) FEATURE: 

(A) NAME /KEY: sig__peptide 

(B) LOCATION: 79.. 144 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 145.. 2745 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 79.. 2745 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7s 

GAATTCCTGA CGACTCCTGA GTTGCGCCCA TGCTCTTGTC AGCTTCGTTT TAGGCGTAGC 60 

ATGGCCAGGC AGAAGAAA ATG GGG CAA AGC GTG CTC CGG GCG GTC TTC TTT 111 
Met Gly Gin Ser Val Leu Arg Ala Val Phe Phe 
-22 -20 -15 

TTA GTC CTG GGG CTT TTG GGT CAT TCT CAC GGA GGA TTC CCC AAC ACC 159 
15 Leu Val Leu Gly Leu Leu Gly His Ser His Gly Gly Phe Pro Aen Thr 
-10 -5 1 5 

ATC AGC ATA GGT GGA CTT TTC ATG AGA AAC ACA GTG GAG GAG CAC AGC 207 
He Ser He Gly Gly Leu Phe Met Arg Aan Thr Val Gin Glu Hia Ser 
10 15 20 

20 GCT TTC CGC TTT GCC GTG CAG TTA TAC AAC ACC AAC CAG AAC ACC ACC 255~ 
Ala Phe Arg Phe Ala Val Gin Leu Tyr Aan Thr Aan Gin Aen Thr Thr 
25 30 35 

GAG AAG CCC TTC CAT TTG AAT TAC CAC GTA GAT CAC TTG GAT TCC TCC 303 
Glu Lye Pro Phe Hia Leu Aan Tyr Hia Val Asp Hia Leu Asp Ser. Ser 
25 40 45 50 

AAT AGT TTT TCC GTG ACA AAT GCT TTC TGC TCC CAG TTC TCG AGA GGG 351 
Aan Ser Phe Ser Val Thr Asn Ala Phe Cys Ser Gin Phe Ser Arg Gly 
55 60 65 



30 
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GTG TAT CCC ATC TTT GGA TTC TAT GAC CAG ATG TCA ATG AAC ACC CTG 399 
Val Tyr Ala He Phe Gly Phe Tyr Asp Gin Met Ser Met Asn Thr Leu 
70 75 80 35 

ACC TCC TTC TGT GGG GCC CTG CAC ACA TCC TTT GTT ACG CCT AGC TTC 447 
Thr Ser Phe Cys Gly Ala Leu Hia Thr Ser Phe Val Thr Pro Ser Phe 
90 95 100 

CCC ACT CAC GCA GAT GTG CAG TTT GTC ATC CAG ATG CGC CCA GCC TTG 495 
Pro Thr Asp Ala Asp Val Gin Phe Val He Gin Met Arg Pro Ala Leu 
105 110 115 

AAG GGC GCT ATT CTG AGT CTT CTG GGT CAT TAC AAG TGG GAG AAG TTT 543 
40 Lys Gly Ala He Leu Ser Leu Leu Gly His Tyr Lys Trp Glu Lye Phe 
120 125 130 

GTG TAC CTC TAT GAC ACA GAA CGA GGA TTT TCC ATC CTC CAA GCG ATT 591 
Val Tyr Leu Tyr Asp Thr Glu Arg Gly Phe Ser He Leu Gin Ala He 
13S 140 145 

45 ATG GAA GCA GCA GTG CAA AAC AAC TGG CAA GTA ACA GCA AGG TCT GTG 639 
Met Glu Ala Ala Val Gin Aan Aan Trp Gin Val Thr Ala Arg Ser Val 
150 155 160 165 

GGA AAC ATA AAG GAC GTC CAA GAA TTC AGG CGC ATC ATT GAA GAA ATG 687 
Gly Aan He Lya Asp Val Gin Glu Phe Arg Arg He He Glu Glu Met 
50 170 175 180 

GAC AGG AGG CAG GAA AAC CGA TAC TTG ATT GAC TGC GAA GTC GAA AGG 735 

Asp Arg Arg Gin Glu Lys Arg Tyr Leu He Asp Cys Glu Val Glu Arg 
185 190 195 

55 
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ATT AAC ACA ATT TTG GAA CAG GTT GTG ATC CTA GGG AAA CAC TCA AGA 783 
lie Aan Thr He Leu Glu Gin Val Val He Leu Gly Lya His Ser Arg 

2GC 205 210 

GGT TAT CAC TAC ATG CTC GCT AAC CTG GGT TTT ACT GAT ATT TTA CTG 831 
Gly Tyr His Tyr Met Leu Ala Ann Leu Gly Phe Thr Asp He Leu Leu 
215 220 225 

GAA AGA GTC ATG CAT GGG GGA GCC AAC ATT ACA GGT TTC CAG ATT GTC 879 
Glu Arg Val Met His Gly Gly Ala Aan He Thr Gly Phe Gin He Val 
230 235 240 245 

AAC AAT GAA AAC CCT ATG GTT CAG CAG TTC ATA CAG CGC TGG GTG AGG 927 
15 Aan Aan Glu Aan Pro Met Val Gin Gin Phe He Gin Arg Trp Val Arg 
250 255 260 

CTG GAT GAA AGG GAA TTC CCT GAA GCC AAG AAT GCA CCA CTA AAG TAT 975 
Leu Aap Glu Arg -Glu Phe Pro Glu Ala Lys Aan Ala Pro Leu Lys Tyr 
265 270 275 

20 ACA TCT GCA TTG ACA CAC GAC CCA ATA CTG GTC ATA GCA GAA GCT TTC 1023 
Thr Ser Ala Leu Thr His Asp Ala He Leu Val He Ala Glu Ala Phe 
280 285 290 

CGC TAC CTG AGG AGG CAG CGA GTA GAT GTG TCC CGG AGA GGA AGT GCT 1071 
Arg Tyr Leu Arg Arg Gin Arg Val Asp Val Ser Arg Arg Gly Ser Ala 
2 5 295 300 305 

GGA GAC TGC TTA GCA AAT CCT GCT GTG CCC TGG AGT GAA GGA ATT GAT 1119 
Gly Aap Cya Leu Ala Aan Pro Ala Val Pro Trp Ser Gin Gly He Asp 
310 315 320 325 

30 ATT GAG AGA GCT CTG AAA ATG GTG GAA GTA CAA GGA ATG ACT GGA AAT 1167 

He Glu Arg Ala Leu Lys Met Val Gin Val Gin Gly Met Thr Gly Asn 
330 335 340 

ATT CAA TTT GAC ACT TAT GGA CGT AGG ACA AAT TAT ACC ATC GAT GTG 1215 

He Gin Phe Aap Thr Tyr Gly Arg Arg Thr Aan Tyr Thr He Asp Val 
345 350 355 

35 

TAT GAA ATG AAA GTC AGT GGC TCT CGA AAA GCT GCC TAC TGG AAC GAG 1263 

Tyr Glu Met Lya Val Ser Gly Ser Arg Lys Ala Gly Tyr Trp Aan Glu 
360 365 370 

TAT GAA AGG TTT GTG CCT TTC TCA GAT CAG CAA ATC AGC AAT GAC AGT 1311 
40 Tyr Glu Arg Phe Val Pro Phe Ser Asp Gin Gin He Ser Aan Aap Ser 
375 380 385 

GCA TCC TCA GAG AAT CGG ACC ATA GTA GTG ACT ACC ATT CTG GAA TCA 1359 
Ala Ser Ser Glu Asn Arg Thr He Val Val Thr Thr He Leu Glu Ser 
390 395 400 405 

45 CCA TAT GTA ATG TAC AAG AAG AAC CAT GAG CAA CTG GAA GGA AAT GAA 1407 

Pro Tyr Val Met Tyr Lys Lys Asn His Glu Gin Leu Glu Gly Asn Glu 
410 415 420 

CGA TAT GAA GGC TAT TCT GTA GAC CTA GCC TAT GAA ATA GCC AAA CAT 1455 
Arg Tyr Glu Gly Tyr Cys Val Asp Leu Ala Tyr Glu He Ala Lya His 
50 425 430 435 

GTA AGG ATC AAA TAC AAA TTG TCC ATC GTT GCT GAC GGG AAA TAT GGT 1503 
Val Arg He Lys Tyr Lys Leu Ser He Val Gly Asp Gly Lya Tyr Gly 
440 445 450 



55 



GCA AGG GAT CCA GAG ACT AAA ATA TGG AAC GGC ATG GTT GGG GAA CTT 1551 
Ala Arg Aap Pro Glu Thr Lys He Trp Asn Gly Met Val Gly Glu Leu 
455 460 465 
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GTC TAT GGG AGA GCT GAT ATA GCT GTT GCT CCA CTC ACT ATA ACA TTG 1599 
Val Tyr Gly Arg Ala Aap Ila Ala Val Ala Pro Leu Thr He Thr Leu 
470 475 480 485 

GTC CGT GAA GAA GTC ATA GAT TTT TCA AAG CCA TTA ATG AGC CTG GGC 1647 
Val Arg Glu Glu Val Ila Aap Phe Ser Lya Pro Leu Met Ser Leu Gly 
490 495 500 

ATC TCC ATC ATG ATA AAG AAG CCT GAG AAA TCA AAA CCA GGC GTA TTC 1695 
He Sar He Met He Lya Lya Pro Gin Lya Ser Lya Pro Gly Val Phe 
505 510 515 

TCA TTT CTG GAT CCC CTG GCT TAT GAA ATC TGG ATG TGC ATT GTC TTT 1743 
Ser Phe Leu Asp Pro Leu Ala Tyr Glu He Trp Met Cya He Val Phe 
520 525 530 

GCT TAC ATT GGA GTC AGC GTA GTT CTT TTC CTA GTC AGC AGG TTC AGT 1791 
Ala Tyr He Gly Val Ser Val Val Leu Phe Leu Val Ser Arg Phe Ser 
535 540 545 

CCT TAT GAA TGG CAC TTG GAA GAC AAC AAT GAA GAA CCT CGT GAC CCA 1839 
Pro Tyr Glu Trp Hia Leu Glu Aap Aan Asn Glu Glu Pro Arg Aap Pro 
550 555 560 56S 

CAA AGT CCT CCT GAT CCT CCA AAT GAA TTT GGA ATA TTT AAC AGT CTT 1887 
Gin Ser Pro Pro Aap Pro Pro Asn Glu Phe Gly He Phe Aan Ser Leu 
570 575 580 

TGG TTT TCC TTG CGT GCC TTT ATG CAG CAA GGA TGT GAT ATT TCT CCA 1935 
Trp Phe Ser Leu Gly Ala Phe Met Gin Gin Gly Cya Aep He Ser Pro 
585 590 595 

AGA TCA CTC TCC GGG CGC ATT GTT GGA GGG GTT TGG TGG TTC TTC ACC 1983 
Arg Ser Leu Ser Gly Arg He Val Gly Gly Val Trp Trp Phe Phe Thr 
600 605 610 



CTG ATC ATA ATT TCT TCC TAT ACT GCC AAT CTC GCT GCT TTC CTG ACT 2031 

Leu Ho He He Ser Ser Tyr Thr Ala Asn Leu Ala Ala Phe Leu Thr 

615 620 625 

35 

GTG GAG AGG ATG GTT TCT CCC ATA GAG AGT GCT GAA GAC TTA GCT AAA 2079 

Val Glu Arg Met Val Ser Pro He Glu Ser Ala Glu Aap Leu Ala Lya 

630 635 640 645 

CAG ACT GAA ATT GGA TAT GGG ACC CTG GAC TCC GGT TCA ACA AAA GAA 2127 
^ Gin Thr Glu He Ala Tyr Gly Thr Leu Asp S*r Gly Ser Thr Lya Glu 
40 650 655 660 

TTT TTC AGA AGA TCC AAA ATT GCT GTG TAC GAG AAA ATG TGG TCT TAC 2175 
Phe Phe Arg Arg Ser Lys He Ala Val Tyr Glu Lya Met Trp Ser Tyr 
665 670 675 

45 ATG AAA TCA GCG GAG CCA TCT GTG TTT ACC AAA ACA ACA GCA GAC GGA 2223 
Met Lys Ser Ala Glu Pro Ser Val Phe Thr Lya Thr Thr Ala Asp Gly 
680 685 690 

GTG GCC CGA GTG CGA AAG TCC AAG GGA AAG TTC GCC TTC CTG CTG GAG 2271 
Val Ala Arg Val Arg Lys Ser Lya Gly Lya Phe Ala Phe Leu Leu Glu 
695 700 70S 



50 



55 



TCA ACC ATG AAT GAG TAC ATT GAG CAG AGA AAA CCA TGT GAT ACG ATG 2319 
Ser Thr Met Aan Glu Tyr He Glu Gin Arg Lya Pro Cya Aap Thr Met 
710 715 720 725 

AAA GTT GGT GGA AAT CTG GAT TCC AAA CGC TAT GGT GTG GCA ACC CCT 2367 
Lyb Val Gly Gly Asn Leu Aap Ser Lya Gly Tyr Gly Val Ala Thr Pro 
730 73S 740 
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AAA GCC TCA GCA TTA GGA ACG CCT GTA AAC CTT GCA GTA TTG AAA CTC 2415 
Lya Gly Ser Ala Leu Gly Thr Pro Val Asn Leu Ala Val Leu Lye Leu 
745 750 755 

AGT GAA CAA GGC ATC TTA GAC AAC CTC AAA AAC AAA TGG TGG TAC GAT 2463 
Ser Glu Gin Gly lie Leu Asp Lya Leu Lya Asn Lya Trp Trp Tyr Asp 
760 765 770 

AAG GGG GAA TGT GGA GCC AAG GAC TCC GGG AGT AAC GAC AAG ACC ACC 2511 
Lya Gly Glu Cya Gly Ala Lya Aap Ser Gly Ser Lya Aap Lya Thr Ser 
775 780 785 

GCT CTG AGC CTC ACC AAT GTG GCA GGC GTT TTC TAT ATA CTT GTC GGA 2559 
Ala Leu Ser Leu Ser Asn Val Ala Gly Val Phe Tyr He Leu Val Gly 
15 790 795 800 805 

GGT CTG GGC CTG GCC ATG ATG GTG GCT TTG ATA GAA TTC TGT TAC AAA 2607 
Gly Leu Gly Leu Ala Met Met Val Ala Leu He Glu Phe Cya Tyr Lya 
810 815 820 

20 TCA CGG GCA GAG TCC AAA CGC ATG AAA CTC ACA AAG AAC ACC CAA AAC 2655 
Ser Arg Ala Glu Ser Lye Arg Met Lya Leu Thr Lya Aan Thr Gin Aan 
825 830 835 

TTT AAG CCT GCT CCT GCC ACC AAC ACT CAG AAT TAT GCT ACA TAC AGA 2703 
Phe Lya Pro Ala Pro Ala Thr Asn Thr Gin Aan Tyr Ala Thr Tyr Arg 
840 845 850 



25 
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GAA GGC TAC AAC GTC TAT GGA ACA GAG AGT GTT AAG ATC TAGGGATCCC 2752 
Glu Gly Tyr Aan Val Tyr Gly Thr Glu Ser Val Lya He 
855 860 865 

TTCCCACTGC AGGCATGTGA TGACAGCAAA TCACCGAAAA CGTGCCTGCT TCAAGGATCC 2812 

TGAGCCAGAT TTCACTCTCC TTGCTGTCGG GCATCACACC AATATTGCTG ATGGTGCAAT 2872 

GACCTTTCAA TACCAAAAAC TGATTTTTTT TTTCCTTCAG TGCCTTATGG AACACTCTGA 2932 

GACTCGCGAC AATGCAAACC ATCATTGAAA TCTTTTTCCT TTGCTTGAAA AAAAATAATT 2992 

AAAATAAAAA CCAACAAAAA TCCACATCCA TCAAACCCTT CATGTATTAA TATTTATTAT 3052 

AGTTTTCATT AGGAATTC 3070 



40 (2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 amino acida 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

45 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 2 



50 



55 



Met Gly Gin Ser Val Leu Arg Ala Val Phe Phe Leu Val Leu Gly Leu 
-22 -20 -15 -10 

Leu Gly His Ser Hie Gly Gly Phe Pro Aan Thr He Ser He Gly Gly 
-5 15 10 

Leu Phe Met Arg Aan Thr Val Gin Glu Hia Ser Ala Phe Arg Phe Ala 
15 20 25 

Val Gin Leu Tyr Asn Thr Aan Gin Aan Thr Thr Glu Lys Pro Phe Hia 
30 35 40 
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Leu Aan Tyr Hia Val Asp His Leu Asp Ser Ser Aon Ser Phe Ser Val 

45 50 55 

Thr Asn Ala Phe Cya Ser Gin Phe Ser Arg Gly Val Tyr Ala lie Phe 
60 65 ^ 70 

Gly Phe Tyr Asp Gin Met Ser Met Aan Thr Leu Thr Ser Phe Cya Gly 

75 80 85 90 



10 



Ala Leu His Thr Ser Phe Val Thr Pro Ser Phe Pro Thr Aap Ala Asp 
95 100 105 



Val Gin Phe Val lie Gin Met Arg Pro Ala Leu Lya Gly Ala lie Leu 
110 115 120 . 



15 



Ser Leu Leu Gly Hia Tyr Lya Trp Glu Lys Phe Val Tyr Leu Tyr Asp 
125 130 135 



Thr Glu Arg Gly Phe Ser He Leu Gin Ala He Met Glu Ala Ala Val 
140 145 150 



20 



. Gin Asn Asn Trp Gin Val Thr Ala Arg Ser Val Gly Aan He Lya Asp 
155 160 165 170 



Val Gin Glu Phe Arg Arg He He Glu Glu Met Asp Arg Arg Gin Glu 
175 180 185 



25 



Lys Arg Tyr Leu He Asp Cya Glu Val Glu Arg He Aan Thr He Leu 
190 195 200 



Glu Gin Val Val. He Leu Gly Lys His Ser Arg Gly Tyr His Tyr Met 
• 205 210 215 

Leu Ala Asn Leu Gly Phe Thr Aap He Leu Leu Glu Arg Val Met His 
220 225 230 

Gly Gly Ala Asn He Thr Gly Phe Gin He Val Asn Aan Glu Aan Pro 
235 240 245 250 



35 



Met Val Gin Gin Phe He Gin Arg Trp Val Arg Leu Asp Glu Arg Glu 
255 260 265 



Phe Pro Glu Ala Lya Aan Ala Pro Leu Lya Tyr Thr Ser Ala Leu Thr 
270 275 280 

Hia Asp Ala He Leu Val He Ala Glu Ala Phe Arg Tyr Leu Arg Arg 

285 290 295 

Gin Arg Val Aap Val Ser Arg Arg Gly Ser Ala Gly Aap Cya Leu Ala 
300 305 310 



45 



Asn Pro Ala Val Pro Trp Ser Gin Gly He Asp He Glu Arg Ala Leu 
315 320 325 330 



Lys Met Val Gin Val Gin Gly Met Thr Gly Asn He Gin Phe Asp Thr 
335 340 345 



50 



Tyr Gly Arg Arg Thr Asn Tyr Thr He Asp Val Tyr Glu Met Lys Val 
350 355 360 



Ser Gly Ser Arg Lys Ala Gly Tyr Trp Aan Glu Tyr Glu Arg Phe Val 
365 370 375 
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Pro Phe Ser Aap Gin Gin He Ser Asn Asp Ser Ala Ser Ser Glu Asn 
380 385 390 
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Arg Thr lie Val Val Thr Thr lie Leu Glu Ser Pro Tyr Val Met Tyr 
395 400 405 410 

Lya Lys Ann His Glu Gin Leu Glu Gly Asn Glu Arg Tyr Glu Gly Tyr 
415 420 425 

Cya Val Asp Leu Ala Tyr Glu lie Ala Lye His Val Arg lie Lye Tyr 
430 435 440 

Lys Leu Ser He Val Gly Asp Gly Lys Tyr Gly Ala Arg Asp Pro Glu 
445 450 455 

Thr Lys He Trp Aon Gly Met Val Gly Glu Leu Val Tyr Gly Arg Ala 
460 465 470 

Asp He Ala Val Ala Pro Leu Thr He Thr Leu Val Arg Glu Glu Val 
475 480 435 490 

He Asp Phe Ser Lys Pro Leu Met Ser Leu Gly He Ser He Met He 
495 500 505 

Lys Lys Pro Gin Lys Ser Lys Pro Gly Val Phe Ser Phe Leu Asp Pro 
1 510 515 520 

Leu Ala Tyr Glu He Trp Met Cys He Val Phe Ala Tyr He Gly Val 
525 530 L 535 

25 Ser Val Val Leu Phe Leu Val Ser Arg Phe Ser Pro Tyr Glu Trp His 

540 545 550 

Leu Glu Asp Asn Aan Glu Glu Pro Arg Asp Pro Gin Ser Pro Pro Asp 
555 560 565 570 

30 Pro Pro Aan Glu Phe Gly- He Phe Asn Ser Leu Trp Phe Ser Leu Gly 

575 580 585 

Ala Phe Met Gin Gin Gly Cys Asp He Ser Pro Arg Ser Leu Ser Gly 
590 595 600 

35 Arg He Val Gly Gly Val Trp Trp Phe Phe Thr Leu He He He Ser 

605 610 615 

Ser Tyr Thr Ala Aan Leu Ala Ala Phe Leu Thr Val Glu Arg Met Val 
620 625 630 

40 Ser Pro He Glu Ser Ala Glu Asp Leu Ala Lys Gin Thr Glu He Ala 

635 640 645 650 

Tyr Gly Thr Leu Asp Ser Gly Ser Thr Lye Glu Phe Phe Arg Arg Ser 
655 660 665 

45 Lys He Ala Val Tyr Glu Lys Met Trp Ser Tyr Met Lys Ser Ala Glu 

670 675 680 

Pro Ser Val Phe Thr Lys Thr Thr Ala Asp Gly Val Ala Arg Val Arg 
685 690 695 

so Lys Ser Lys Gly Lya Phe Ala Phe Leu Leu Glu Ser Thr Met Asn Glu 

700 70S 710 

Tvr He Glu Gin Arg Lys Pro Cys Asp Thr Met Lys Val Gly Gly Asn 
715 720 725 730 

55 Leu Asp Ser Lys Gly Tyr Gly Val Ala Thr Pro Lys Gly Ser Ala Leu 

735 740 745 
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Gly Thr Pro Val Aan Leu Ala Val Leu Lya Leu Ser Glu Gin Gly lie 
750 755 760 

5 Leu Aap Lys Leu Lys Asn Lye Trp Trp Tyr Asp Lys Gly Glu Cys Gly 
765 770 775 

Ala Lys Asp Ser Gly Ser Lys Asp Lys Thr Ser Ala Leu Ser Leu Ser 
780 785 790 

™ Asn Val Ala Gly Val Phe Tyr lie Leu Val Gly Gly Leu Gly Leu Ala 
795 800 805 S10 

Met Met Val Ala Leu He Glu Phe Cys Tyr Lys Ser Arg Ala Glu Ser 
815 820 825 

'5 Lys Arg Met Lys Leu Thr Lys Asn Thr Gin Asn Phe Lys Pro Ala Pro 
830 835 840 

Ala Thr Asn Thr Gin Asn Tyr Ala Thr Tyr Arg Glu Gly Tyr Asn Val 
845 ' 850 855 

20 Tyr Gly Thr Glu Ser Val Lys He 
860 865 

(2) INFORMATION FOR SEQ ID NO: 9; 

25 (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Gly Ser Ala Leu Gly Asn Ala Val Asn Leu Ala Val Leu Lys Leu Asn 
35 1 5 10 15 

Glu Gin Gly Leu Leu Asp Lys Leu Lys Asn Lys Trp Trp Tyr Asp Lys 
20 25 30 

Gly Glu Cys Gly Ser Gly Gly Gly Asp Ser Lys Asp Lys Thr 
40 35 40 45 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Ser Ala Leu Gly Thr Pro. Val Asn Leu Ala Val Leu Lys Leu Ser 
15 10 15 

Glu Gin Gly He Leu Asp Lys Leu Lys Asn Lys Trp Trp Tyr Asp Lys 
20 25 30 



42 



EP 0 574 257 A2 



10 



15 



25 



30 



Gly Glu Cya Gly Ala Lye Aap Ser Gly Ser Lys Aap Lys Thr 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 13 base pairs 
<B) TYPE : nucleic acid 

(C) ST RAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid; 

(A) DESCRIPTION: Synthetic DNA oligonucleotide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
AGCTTGCGGC CGC 13 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid; 

(A) DESCRIPTION: Synthetic DNA oligonucleotide 

(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GCGGCCGCA 

35 (2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: Other nucleic acid; 

(A) DESCRIPTION: Synthetic DNA oligonucleotide 



45 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 

ACACTCAGAA TTACGCTACA TACAGAGAAG GCTACAACGT 40 
(2) INFORMATION FOR SEQ ID NO: 14: 

3, (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base paire 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid; 
55 (A) DESCRIPTION: Synthetic DNA oligonucleotide 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CCAGATCGAT ATTGTGAACA TCAGCGACAC GTTTGAGATG 40 
(2) INFORMATION FOR SEO ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE: Other nucleic acid? 

(A) DESCRIPTION: Synthetic DNA oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GTGAATGTGG AGCCAAGGAC TCGGGAAGTA AG 32, 



Claims 

25 1. An isolated polynucleotide comprising a region which encodes an AMPA-binding human GluR receptor 
selected from the group consisting of human GluR1 B. GluR2B, GluR3Aand GluR3B receptors, and AMPA- 
binding fragments thereof. 

2. An isolated polynucleotide according to claim 1, which encodes said GtuRtB receptor, said GluR2B re- 
30 ceptor, said GluR3A receptor or said GluR3B receptor. 

3. An isolated polynucleotide comprising a region which encodes an AMPA-binding variant of a GluR receptor 
selected from the group consisting of human GluRIB, GluR2B, GluR3A and GluR3B receptors, wherein 
said variant has the binding profile of said receptor and varies from said receptor by conservative amino 

35 acid substitution. 

4. An isolated polynucleotide according to any one of claims 1 to 3, which consists of DNA. 

5. A recombinant DNA construct having incorporated therein a polynucleotide as defined in any one of claims 
1 to 4. 

40 

6. A recombinant DNA construct according to claim 5, wherein the polynucleotide incorporated therein is 
linked operably with DNA enabling expression and secretion of said receptor in a cellular host. 

7. A recombinant DNA construct according to claim 5, which is plasrnid pBS/humGluR3A(ATCC 75218), plas- 
45 mid pBS/humGluR3B (ATCC 75219); plasrnid pBS/humGluR1B (ATCC 75246) or plasrnid pBS/hum- 

GluR2B (ATCC 75217). 

8. A cellular host having incorporated therein a heterologous polynucleotide as defined in any one of claims 
1 to 4. 

50 

9. A cellular host according to claim 8, which is a mammalian cell. 

10. An AMPA-binding membrane preparation derived from a cellular host as defined in claim 8 or claim 9. 

11. A process for obtaining a substantially homogeneous source of human GluR receptor, which comprises 
55 the step of culturing a cellular host as defined in claim 8 or claim 9, and then recovering the cells so cul- 
tured. 

12. A process for obtaining a substantially homogeneous source of human GluR receptor according to claim 
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55 



11 comprising the subsequent step of obtaining a membrane preparation from the cultured cells. 

13. A method of assaying a substance for binding to a human EAA receptor, which comprises the steps of 
incubating the substance under appropriate conditions with a cellular host as defined in claim 8 or claim 
9, or with an AW PA- binding membrane preparation derived therefrom, and determining the extent of bind- 
ing between the human GluR receptor end the substance. 

14. An isolated human GluR receptor selected from the group consisting of GluRIB, G!uR2B, GluR3A and 
GluR3B receptors, and AMPA-binding fragments thereof, in a form essentially free from other proteins of 
human origin. 

15. An AMPA-binding fragment of a human GluR receptor selected from the group consisting of GluRIB, 
GIuR2B, GluR3Aand GluR3A receptors. 

16. An antibody which binds a human GluR receptor selected from the group consisting of GluRIB, GluR2B, 
GluR3A and GluR3B receptors. 

17. An immunogenic fragment of a human. GluR receptor selected from the group consisting of GluRIB, 
GluR2B, GIuR3A and GluR3B receptors. 

18. An oligonucleotide which comprises at least about 17 nucleic acids and which hybridizes selectively with 
a polynucleotide defined in any one of claims 1 to 4. 
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